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ON THE ROOTS OF A CERTAIN TYPE OF MATRIX 


JOHN Z. HEARON 
NATIONAL INSTITUTE OF ARTHRITIS AND METABOLIC DISEASES, 
NATIONAL INSTITUTES OF HEALTH, BETHESDA, MARYLAND 


The matrix of any linear (differential) system formally equivalent to 
the so-called mammillary compartmental system is considered. By sym- 
metrization and appeal to a classical theorem a separation property of the 
roots is proved. Conditions that establish the signs of the roots are 
given. 


There is a class of problems in linear kinetics in which the 
matrix of the set of differential equations is real and of the form 


a ie 
fe : | (1) 


where a is a scalar, r and c are row and column matrices respec- 
tively, and D is a diagonal matrix of order n. In most physical 
problems (some examples are cited below) a and the elements of D 
are non-positive while the elements ofr and c are non-negative. 

It is the primary purpose of this note to show that when every 
term in the sum rc is positive the roots of A are real and that when 
the d; are distinct the roots of A are distinct and separated by the 
d;- Under the further restriction that d;d; > 0 for all ¢ and j, condi- 
tions are given for the roots to be non-positive or non-negative. 

The separation property mentioned above has previously been 
demonstrated by Smith and Morales who assumed the roots to be 
real and distinct (1944) and for a rather special case by Sheppard 
and Householder (1950). These cases are mentioned in what follows. 

Consider the nonsingular matrix 


i- “0 
A, 2 
r~|) | (2) 
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and the similarity transformation 


care Eee peers (3) 


We seek a matrix Q such that B is symmetric. If Q is diagonal, 
Q = diag (71, Go, +++ 9,)» then the condition for the symmetry of B 
is 


Q-*¢ = Qr', (4) 
If r; = 0 only if c; = 0, then (4) determines the g; as 
qj = Ve,/t; (5) 


for all j for which r, 4 0, and q, is non-zero but otherwise arbitrary 
for all j for which r= 0. If further for the non-zero elements of r 
and c, r,c, > 0, then Q and hence B is a real matrix. 

Thus under the conditions that zero elements occur in corre- 
sponding positions in r and ¢ and that the corresponding non-zero 
elements of r and c are of like sign we have produced a real sym- 
metric matrix B such that 


qT: 
rar=5-|* sl (6) 


where 6 = Q-'c =Qr’?. Since A and B have the same roots and B, 
being real and symmetric, has real roots, it has been proved that 
the roots of A are real. 

The symmetrization (6) is a special case of a more general propo- 
sition (Hearon, 1953): A real matrix A of order N, is symmetrizable 
by a real, diagonal, similarity transformation if (i)' the number of 
non-zero off-diagonal elements is not more than 2(N — 1), (ii) the 
zero off-diagonal elements are symmetrically disposed and (iii) for 
non-zero off-diagonal elements a;;a;; > 0. The derivation has been 
presented here de novo because of the simplicity which obtains 
when the matrix takes the form (1). 

By a known theorem (Browne, 1930; Hamburger and Grimshaw, 
1951) the roots of a Hermitian matrix of order n + 1 are separated 
by the roots of a leading principal minor of an order n, provided 


1Tf condition (i) is dropped it must be replaced by ‘‘cyclic conditions’? 
of the type @;; aj; @%; = @j;@%j;@;x, Which have been derived and discussed 
in some detail elsewhere (Hearon, 19538). It has been shown there that 
any matrix, such that a;;=0 only if @j; = 0 and for which the cyclic re- 
lations are trivially satisfied, has precisely 2(N —1) non-zero off- 
diagonal elements. 
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these latter roots are distinct. In particular with reference to the 
matrix B above, the roots of D, which are the elements d;, sepa- 
rate the roots of B, which are the roots of A, provided the d; are 
distinct. If the roots of A and the d; are numbered so that ee 


Ag > «++ > Angi and d; > dg >...>d,, we then have the separation 
property 


ee aoe Aa OS se On > Np eye (7) 


We now consider the sign of the \,;. In addition to the conditions 
leading to the separation (7) we assume that D is nonsingular; then 
if the d; are all of like sign, n of the d; are of that sign and the re- 
maining root (in fact either the maximal or minimal root) has the 
sign of m=a-—rD-'c. For if D~! exists the determinant of A is 
given by” 


[A] =|D|m=AidAo.-. Angi (8) 


If, for all 2, d; < 0 then, by (7), Ao, As, ---, Angi are negative and 
it follows from (8) that A; and m must have the same sign. Simi- 
larly, if, for all 7, d; > O then Ay, Ag, ..-., A, are positive and (8) 
requires that A,4, and m be of like sign. If m=O there is one 
zero root; thus d; < 0, m <0 implies A; £0 and conversely, while 
d; > 0, m 2 0 implies A; 2 0 and conversely. 

In a variety of applications a and the d; are negative while the 
elements of 7 and ¢ are positive. Given this it is usually quite 
simple to establish the sign of m as the following examples show. 

For any problem formally equivalent to that of the ‘‘mammillary”’ 
compartmental system (e.g., Sheppard and Householder, 1950) the 
relevant matrix takes the form (1). The matrix of the differential 
equations for the specific activities in a mammillary system with n 
peripheral compartments is 


[=(Bo + =A) At Ag A, 
ay —(a1 +4 Ba) 0 
Og 0 —(H2 + Bo) 0 
A= F 
a 0 0 —(On ats Bn) _| 


2This result can be obtained as a special case of a theorem given by 
Gantmacher (1959, pp 45-6) or from the Cauchy formula for a bordered 
determinant. 
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where the A;, 4; and 6; are positive and §; is the excretion rate 
from the zth compartment. When the 8; = 0 the system is closed, 
rD~'c = DA, and m=0. This is the case treated by Sheppard and 
Householder (1950). In general m = —(Bo + 2A) + YO; Ai/(Ai + 
B;)S -(Bo + 2A) + 2ASO. It is a fact that the above matrix 
possesses a dominant diagonal and, given that the roots are real, 
that they are negative can be proved by applying a well known 
theorem (e.g., Brauer, 1946; see Hearon, 1953, for such an appli- 
cation in the chemical kinetic case) or by applying a known theorem 
(Hearon, 1953) to the quadratic form of B. There are several situ- 
ations in which the matrix does not have a dominant diagonal. For 
example, for the problem considered by Smith and Morales (1944) 
the matrix takes the form 


-(R+26;) 1f1 Xe Be On Bn 
ae Vo Vo . . . . . Vo 
Br 7 %1 Bi 0 2 Shah Berk ges @) 
Vi Vi 
Bao ing 2: 
v. 0 ia et a | a ae 0 
A es 2 Vo 
Bn 0 0 oh aA ry 7 On Bn 
Vn Vi, 


——_— 


where RF, %;, 8; and V; are positive. Here rD-!c = — 28;/Vo and 
m=-R<0. 
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A mathematical theory is developed to explain the observed enhance- 
ment of Og transport through solutions by hemoglobin. At high partial 
pressures of Og, ordinary diffusion through the solvent accounts for all 
transport of Og, but at low partial pressures the transport may be in- 
creased manyfold by the presence of hemoglobin. This phenomenon is 
explained and its possible role in living organisms is discussed. The 
theory also indicates a new method of determining dissociation curves 
from diffusion experiments. 


Two recent articles in Science have described several interest- 
ing and perhaps important features of gas transport through hemo- 
globin solutions. Scholander (1960) and later Hemmingsen and 
Scholander (1960) reported observation of an enhanced diffusion of 
oxygen through solutions containing hemoglobin. Briefly, these 
experiments are reviewed as follows: 

A millipore membrane saturated with a hemoglobin solution 
separated two gas chambers. In the first instance, one chamber 
was maintained at near vacuum while oxygen at a fixed pressure 
filled the other chamber (Hemmingsen and Scholander, 1960). In 
the second case, oxygen at different pressures was maintained in 
the two chambers (Scholander, 1960). $imilar experiments were 
performed using nitrogen in place of oxygen. In both cases, the 
flux of gas through the solution-filled membrane was measured. 

The results of these measurements revealed that while the 
nitrogen flux was always simply proportional to the difference in 
nitrogen partial pressures of the two chambers, as required by 
Fick’s law of diffusion, such was not the case for oxygen. It was 
found that the rate of transport of oxygen was proportional to the 
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difference in oxygen partial pressures of the two chambers only 
when the partial pressure of oxygen in both chambers was ap- 
preciable. When the oxygen partial pressure in the low pressure 
chamber was very small, the rate of transport was several fold 
greater for the same difference in oxygen partial pressure. 

In particular the first reported results (Hemmingsen and Scho- 
lander, 1960) showed that when the low pressure chamber was 
maintained at near vacuum, the O, transport could be represented 
by: total oxygen flux = nitrogen flux X.56 + constant, where .56 is 
the O,/N, flux ratio in hemoglobin free solution for the same dif- 
ference in pressure. The second reported results (Scholander, 
1960) revealed that this augmentation of transport was not con- 
stant but was reduced by a back pressure of oxygen. Furthermore, 
the augmentation was shown to be proportional to the hemoglobin 
concentration of the solution, except at very high concentration. 

Scholander pointed out that this additional transport mediated by 
the hemoglobin could be due to the random Brownian motion of the 
hemoglobin molecules. Thus a hemoglobin molecule could move 
from a region of high oxygen concentration to a region of low oxy- 
gen concentration. In doing so, it would give up some of its as- 
sociated oxygen which would be taken up by hemoglobin molecules 
in the low concentration region. This, of course, would be ac- 
companied by hemoglobin molecules moving in the opposite direc- 
tion. In this manner a ‘‘bucket-brigade’’ would mediate oxygen 
transport. That thermal or Brownian motion plays an important 
role in the transport of oxygen was demonstrated by the fact that 
oxygen transport was greatly reduced when the solution was so- 
lidified with agar gel. Indeed the theory presented below indi- 
cates that this is the mechanism of hemoglobin transport of oxy- 
gen, as well as of other gases which associate with hemoglobin. 

Mathematical Theory. We begin by observing that, for a gas 
which associates with hemoglobin, the quantity of gas associated 
at equilibrium can be characterized by a dissociation curve. Thus, 
in a solution, the hemoglobin will be partially saturated with the 
gas (oxygen) when the solvent contains a certain partial pressure 
(concentration) of gas (oxygen). The degree of saturation, or 
fractional saturation, we denote by S(C), where S is a fraction be- 
tween zero and one and C is the gas concentration in the solvent. 

Thus, if m is the number of moles of gas associated with one 
mole of hemoglobin at complete saturation and X is the number of 


GAS TRANSPORT THROUGH HEMOGLOBIN SOLUTION 225 


moles of hemoglobin* per volume of solution, we have 


Z =nXS(C) = moles of gas associated with hemoglobin per 
volume of solution for gas concentration C in (1) 
the solvent. 

Due to the random thermal or Brownian motion a hemoglobin 
molecule will, on the average, travel a distance A in a particular 
direction during a time T. To see the effect of this motion on gas 
transport consider two equal volumes, (a) and (6) separated by a 
plane aa of area A as shown in Figure 1. 


E . E+X axis 


FIGURE 1. Showing the domains in the field of transport considered in 
the text. 


Since steady diffusion of gas through the solvent in the + € di- 
rection is assumed, the two volumes have a different concentration 
of gas in the solvent, that is C = C(é). 

Now on the average during a time T, exactly one-half of the 
hemoglobin molecules in volume (a) will cross the area A into the 
volume (4), while the other half move in the opposite direction. 
Similarly, during this time one-half of the hemoglobin molecules in 
(6) will cross into volume (a). If these volumes are volumes of 
millipore filter having a porosity ¢ = void volume/bulk volume, 
then each volume contains a volume of solution equal to AA. 
During the time t, the number of moles of gas moving from (a) to 
(b) by being associated with hemoglobin is frAAZ,/2. Here Z, is 
the mean value of Z in (a). Similarly, the number of moles of gas 
moving from (5) to (a) ‘‘on the hemoglobin’’ during T is prAZ, /2, 
where Z, is the average value of Z in (b). Thus the net rate of 


*Note that here we call both oxygenated and non-oxygenated molecules 
just hemoglobin. 
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transfer of gas in the +é direction by thermal motion of hemoglobin 
is 
_ GAA 


N‘ LZ oe 2 
Cm on b (2) 

If t is sufficiently small, i.e., in the limit as T—>0, then 
}— 0 and the average values Z, and Z, can be replaced by the 


values at the midpoints of the volumes € and € + A thus: 


Z, = Z(), 
oe (3) 
Za E+ d)= 26+ Oa, 
and then 
v.=--Ad ». 94 
Ne =- Ag — = (4) 


Now ?/2t is just the diffusion coefficient for hemoglobin dif- 
fusion through the solution. This is easily seen by considering 
the case of uniform gas concentration throughout the solution. In 
this case C and hence S(C) are constant. 

Then, from equation (1), 


Ee vette (5) 
0g d€ 
and equation (4) becomes 
. d? dX 
1c = —~AgdnS — —. 6 
G eo Fy (6) 


But since nS is the number of moles of gas per mole of hemoglobin, 
we have for the rate of diffusion of hemoglobin molecules in a free 
solution, i.e., 6 =1, 


at ax 
Qt 0&’ 
which is just Fick’s law of diffusion with A?/2t being the dif- 


fusion coefficient for hemoglobin molecules. Thus we denote 
\?/2T by 


Nin = ~4 (7) 


r? 
D =e 
Bird (8) 
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The theory of Brownian motion (Glasstone, 1946, p. 259) shows 
that Dy, is related to the temperature and viscosity of the solvent, 
at low hemoglobin concentration, by 

cae 
6rnrdy. 


Duy = (9) 
where F is the gas constant per mole, 7 is absolute temperature, 
n is solvent viscosity, Ay is Avogadro’s number and 7 is the ef- 
fective radius of the hemoglobin molecule. Thus we have for 
transport of gas by hemoglobin within the millipore membrane, ** 


Re ee in XS(O)I, (10) 
d¢ 
or, when equation (9) is valid, 
A ae =e nXS(C)]. (11) 


Of course, gas is also transported simply by diffusion within the 
solvent. This is represented by Fick’s law as 


: dc 
an es 12 
ae 32 (12) 
where D¢ is the diffusion coefficient for the gas molecules in the 
solution. Then the total rate of transport of gas across the area 
A is 


Ni Ad, {be Lea ee 2 inxs(o) (13) 


0€ 0€ 
Now it is reasonable to assume that, in the experiments previously 
described, the number of hemoglobin molecules per unit volume of 
solution remains uniform throughout the solution, i.e., X = constant, 
thus: 


: dS(C) | oC 
Ne --4610¢ + DiyypnX “ 1} 2. (14) 


This shows that the transport of gas by hemoglobin has the effect 
of replacing the diffusion coefficient for gas, Dg, by the effective 
value, 


**Strictly speaking, the diffusion coefficient is modified by the milli- 
pore membrane. 


228 R. E. COLLINS 


dS(C) 


ae (15) 


De = De ot DyynX 5 
which is a function of gas concentration, C. ; 

On the other hand since, for steady state, Ng must be independent 
of €, equation (14) can be integrated over € from the high pressure 
side, €= 0, to the low pressure side, €=L. Thus from (13) we 


find the following expression for the steady-state: 


Nee “Pid 6 (Co 2107) ak Da pi8(65) a SiGe (16) 


Here Co and C, are the values of C in the solution at €=0 and 
€=L, respectively. 

Now the concentrations C» and C,, are not the same as the con- 
centrations in the respective gas chambers. If the gas obeys 
Henry’s law (Glasstone, 1946, p. 697) and Dalton’s law of partial 
pressure, then, 


= kg = constant, (17) 


Pc¢(9) pe (Z) 


where pc denotes the partial pressure in the appropriate gas 
chamber. However deviations from this law are common for many 
gases, particularly COg. 

Comparison of Theory and Experimental Results. That the 
mathematical theory outlined above describes the experimental re- 
sults discussed previously is shown as follows: First, note that 
in any plane, €= constant, the value S(C) is assumed to be the 
equilibrium value corresponding to the local value of C. Thus the 
S, C relationship is simply that characterized by the appropriate 
dissociation curve. 

To apply this theory to the nitrogen-oxygen diffusion experi- 
ments we write equation (16) first for No, noting that No does not 
associate with hemoglobin, thus: 


Ny = <2 Dy ke ,[Pw,(0) - Px, (L)] (18) 


where equation (17) has been used; then for Og, 


No, = “*0o,ko,?0,(0) - Po,(L)1 + 


Pax Hb S(Po(0)) - So (Z))]. (19) 
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Here S’ denotes the fractional saturation of hemoglobin with Ox» as 
a function of Og partial pressure (in adjacent air) rather than as a 
function of O» concentration in the solvent. This function, S’, is 
obtained by replacing C by ko Po, in the function S(C), 

Now for low CO, concentration, and low pH, the oxyhemoglobin 
dissociation curve rises steeply = S’= 0 at Po. = 0 and quickly 
approaches S’=1 as an asymptote at relatively small values of 
Po,- Thus, for po,(0) >> 20 mm. Hg, S’ is approximately unity. 
parideniag then.a sequence of eeocabenie in which po,(0) is 


sufficiently large we have, for equal pressure differences “of O. 
and No: 


/ Do,fo, 


N 
is Pins 


ah Padus So(E)). 20 
This equation describes almost exactly a sequence of experiments 
as reported by Hemmingsen and Scholander (1960, see Figure 3). 
In this sequence of experiments the difference in gas pressure 
across the membrane has a fixed value of 80 mm. "g. The pres- 
sures in both chambers were varied however, that in the high 
pressure chamber ranging from 80 mm. Hg to 140 mm. Hg, and that 
in the low pressure chamber from essentially zero to 60 mm. Ng. 
Thus in all cases S’(po,(0)) is greater than about .80 while Spo, (L)) 
is always less than the high pressure value. 

It is evident from equation (20) that the maximum value of No, 
occurs when po,(0) >> 20 mm. Hg and po,(L) = 0. In this case 


° Do fo, 
VO.) vaeiene = ( k 
Dy, N, 


A 
:) i, + PaXDuy, (27) 


which agrees with the reported results. In particular the reported 
results yielded 
(Ng ¥& = .56 Ny, + constant, (22) 


as mentioned earlier. 

In every respect this theory agrees with the reported experi- 
mental observations. Thus the augmentation of Og transport is 
proportional to the hemoglobin concentration, X, up to concentra- 
tions great enough to modify the value of Dy,. Effects of pH, 
temperature, COg, etc., are also implied through the dependence 
of Dj, and the dissociation curve on such factors. The reported 
effect of gelation of the solution is indicated. Thus according to 
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equation (9) Dy, decreases as the viscosity of the solvent in- 
creases; this then predicts a reduction of Og transport. 

Discussion and Applications. An important application of the 
theory presented here would be to the determination of dissocia- 
tion curves. Thus consider a millipore membrane filled with a 
hemoglobin solution and separating two gas filled chambers. Sup- 
pose the gas in the two chambers to be either Ng or Og. 

Let several diffusion measurements be made, always with the 
same pressure differential of No and Og. However, always have 
the po, on the high pressure side sufficiently high to consider 
S’= 1 on this side of the membrane. 

Also, prior to each run, introduce a fixed amount of CO,, always 
the same to both chambers. Thus Poco, is the same on both sides 
of the membrane and is constant throughout the series. 

Then we have for the series of measurements, 


Ny, = constant, (23) 


and for po, = 0 at €=L, po, >> 80 mm. Hg at ¢= 0, 
: : : A 
No, = (No max. = 2NN, + 52 nXD ys, (24) 


where Do, ko,/ Pn, kn, = % can be determined; and for other Po, at 
e - Ls 


maxXs 


‘ ‘ A . 
No, = (No max, ~ G="XDn¥8(P0,) (25) 
where Po, is on the low pressure side, €= LL. Thus we have 


” (N ) pty N 
S\po,) = “Ovtmaxs — Sos(Po.) 


(26) 
(No RR ry aNn 


2 


which can be used to calculate points on the dissociation curve 
corresponding to the particular po o, employed. 

We might note that for the same difference in Po, but with both 
chambers having very high Po,» S’= 1 on both sides, then 


(No slates 1 ONN, (27) 


or 


(Nowmax. ~ Nos)mnin. = TP nxDu- (28) 
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Hence No» can be eliminated from the system and only Og employed, 
but always with the fixed Pco, in both chambers, thus in either 


case we have, 


> (N ) = N 
S(Po,) = Palmax. 0,(P0,) 


(No. max. Ed (No ) 


2’min,. 


(29) 


which is a simple way to calculate S’ versus Po, from diffusion 
measurements. 

While the theory described has accounted for all features of en- 
hanced transport of gases 03, CO, or CO, which associate with 
hemoglobin, it in itself does not indicate the role of this mecha- 
nism within living organisms. Since the dissociation curves for 
O, and CO, are similar in form, it is evident that the mechanism 
is the same for both of these gases within the living organism. In 
particular, transport of gases within the red blood cell should be 
described by this theory. 

Referring to equation (15) for the effective diffusion coefficient 
in a region of uniform hemoglobin concentration, it is evident that 
in regions of low concentration of the gas in question the transport 
of the gas is much greater for a given gradient of concentration 
than in a region of high concentration. This follows since dS/dc 
decreases rapidly with increasing C. Thus an important role of 
this mechanism in living organisms could be to maintain Og and 
CO, transport even with lowered gradients of gas concentration. 
Thus oxygenation of hemoglobin in red blood cells should be more 
rapid at low oxygen partial pressures in blood plasma. It could 
also mediate an equitable distribution of gases to tissues, l.e., as 
concentration of O, falls in a given region of tissue the transport 
of O, to that region is spontaneously increased by this mechanism. 

In conclusion it should be noted that the theory described here 
must be modified to include the reaction rate of gas with hemo- 
globin if the gas concentration in the solution changes rapidly 


with time. 


Notes added in proof: A brief summary of this work appeared in 
Science in May of this year (Collins, 1961). Subsequently the 
author received many private communications, some of which 
should be mentioned. One from Dr. J. B. Wittenberg pointed out 
that he had proposed augmentation of Oz transport in tissues by 
myoglobin in his paper ‘Oxygen transport—a new function pro- 
posed for myoglobin,”’ Biological Bulletin, 117, No. 2, 402-408, 
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October, 1959. This antedates the work of Scholander and Hem- 
mingsen. A communication from Dr. I. S. Langmuir pointed out 
that in the red blood cell the hemoglobin molecules are tightly 
packed and self-diffusion will not occur (Perutz, Nature, 161, 
204). However, he suggests, as did Dr. Wittenberg, that myoglobin 
in muscles or other tissues should enhance Og transport as de- 
scribed by the theory given here (see ‘“‘The Measurement of the 
Diffusion of Oxygen through Respiring Tissue”’ by I. S. Langmuir 
and Ann Bourke, Biochemistry Journal, 1960, 76, 225). 

Another communication, from Dr. Norman C. Staub, University of 
California, San Francisco Medical Center, suggested that rota- 
tional rather than translational motion of the hemoglobin molecules 
produced the augmentation of O2 transport. This might be the 
case, or a combination of both motions could occur. However, the 
only effect in our equations would be a modification of the value 
of Duy. 
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With a view to future applications in relational biology, the notion of 
relations between sets is introduced and several theorems are demonstrated. 


In relational biology we frequently find situations in which sets 
of biological subproperties (Rashevsky, 1960) stand in the same 
relation to each other as do the corresponding basic sets, or even 
as some other kinds of sets. For example, it is legitimate to say 
that the eye of a higher organism, an organ composed of a set of 
such diverse elements as sensory cells, muscle cells, secretory 
cells, etc., stands in the same basic relation to the locomotory 
apparatus of the organism as the sensitive spot of a unicellular 
organism to the locomotory mechanism of the latter. 

Or, speaking in terms of properties rather than anatomic car- 
riers, we may say again that the biological property of the eye 
stands in a certain relation to the locomotory property of the whole 
organism. But both properties are sets of numerous subproperties 
(Rashevsky, 1960). 

Such situations occur, as a matter of fact, quite frequently. Yet, 
if we try to formulate them mathematically in order to deduce any 
possible conclusions, we are confronted with a lack of the notion 
of relations between sets in mathematics. Inasmuch as situations, 
such as those mentioned above, are more than likely to be of im- 
portance biologically, it is desirable to develop the appropriate 
mathematical tool to deal with them. This note is meant as the 
very first step towards such a development. Since we are starting 
from scratch and since it is the first step, no immediate application 
should be expected. But any further development along these lines 
will probably have to follow this or a similar approach. 

We shall now give a few definitions and elementary theorems. 
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Let A, B, C,...be finite sets. Let a, 5, c,...be elements. 
Let «Ry be any binary relation, with z in its domain, y in its 
range. 

Definition 1. We say that a strong relation R exists between A 
and B, and write it ARB if and only if, for every pair of elements 
a, 6 such that aeA, bEB, we have akb. 

Definition 2. We say that a weak relation R exists between A 
and B, and write Ak’B, if and only if, (1) for every element a€ A 
there exists an element b€ 8B, such that afb; (2) for every element 
b’¢ B there exists an element a’ € A, such that a’ Ff 6’; (3) there 
exists at least one pair a’ eA, 6°’ €B, such that the relation 
a’Rb’’ does not hold. 

Theorem 1. If ARB, andif F is not reflexive, then ANB = Q. 

Proof. Consider an element a of the set An B. It is by defi- 
nition an element of A; it is also an element of B. Hence, from 
definition 1 it follows aka. Q.E.D. 

Theorem 2. If AR’B and if An B #9, then there exists at least 
one pair of elements a,a’ € A, such that aka’, and at least one 
pair of elements 5, 6’ € B, such that bPd’. 

Proof. Let AmB contain only one element a. Then a& A. By 
definition 2 there exists an element 5€ 8, such that akd. Butais 
also in B, hence a and 6 are a pair of elements in B, such that 
akb. 

Since aéB, therefore by definition 2 there exists an element 
a’ € A, such that a’ Ra, But ais also in A. Hence a, ais a pair 
of elements in A such that a’Ra, 

If AmB contains more than one element we apply the same 
reasoning to each of them and establish thus that there is more 
than one pair of elements a,a € A and more than one pair of ele- 
ments 6, 6° ¢ B such that aRa’, dRd’. Q.E.D. 

Theorem 3, If ARC and BRC, then (A vu B)RC. 

Proof. For each pair a € AUB, and céC, we have a’Re, be- 
cause @ belongs to either A or B, or both. 

Theorem 4. If AR’C and BR’C, then (A u B)R’C. 

Proof. By definition 2, AR’C implies that for each element a&€A 
there exists an element c&€ C, such that aRc. Similarly, BR’C 
implies that for each element b€B there exists c& C, such that 
bRc. Hence for every element & of AUB there exists a ce C, 
such that “Ac. Thus the first part of definition 2 is satisfied for 
Au B and C, 


Conversely, for each ceé @ there is an ae€ A anda b€ B, such 
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that akc and 6Rc. Hence for each c€C there exists an ae JA 
UB, such that &f’c. Thus the second part of definition 2 is 
satisfied for AU B. 

Finally, from definition 2 and from AR’C, BR’C it follows that 
there is at least one pair a’€A, c’ €C of elements, such that the 
relation a f’c’ does not hold. But a € AUB. Hence there is at 
least one pair @& € AUB, c’' €C, such that a’Rc’ does not hold. 
Thus the third part of definition 2 is satisfied for dU B and C. 
Hence (Au B)R’C. Q.E.D. 

Theorem 5. If ARC and BRC, and if An B #9, then (An B)RC. 

Proof. Every element of Ano B is an element of both A and B. 
Therefore for any &€ AB and c€C we have “Rc, Q.E.D. 

A similar theorem does not hold for weak relations. If AR, 
BR’C and AnB #498, then for every ©€ AY B there is a c&EC 
such that “fc, because & is both in A and in B. But there may 
exist elements c;€ C of a kind that there is no 4¢€ AB such that 
aRe. True, there must exist for each c; at least one a@€ A and 
6€B, such that akc;, bRc;. But both a and 5 may be outside of 
ATR. 

Theorem 6. If ARB, then for any proper subset A’ of A we have 
A’RB. 

Proof. For every a’¢ A’, we also have a’ € A. Hence for every 
pair @ € A’, b€ B, we have ad. Q.E.D. 

A corresponding theorem does not hold for weak relations. For 
every a € A’ there is always a c€C, such that akc. But itis 
possible to have a c,;€C, such that the corresponding @€ A for 
which aRc is in A- A’. 

Theorem 7. If R is transitive, then ARB and BRC implies ARC. 

Proof. ARB means that for every pair a€ A, 6€B we have 
aRb, while BRC implies that for every pair }€b, c&C, we have 
bRc. This means that for every triplet ae A, b€ B, ce C, we have 
aRb and bRe. Because of transitivity of R, we have for every @, c, 


akc. 


This work was aided by United States Public Health Service 
Grant RG-5181. 
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The tremendous complexity of even the simplest living unit makes a 
correct theoretical guess as to its mechanism very difficult. It is there- 
fore suggested that, following the example of the physical sciences, a 
number of purely abstract cases in molecular biology be studied mathe- 
matically at first. Subsequent comparison of the different conclusions of 
such an abstract theory with available data would enable us to decide 
which of the conceivable situations are likely to occur in reality. 

As a first step toward such a study the problem of the minimal size of 
a living unit is studied. Usually the minimal size is considered to be 
determined by information-theoretical requirements. It is shown that the 
minimal size may have a very different origin. It may be determined by 
the possibility that too small a system, even though performing all 
necessary biological functions, may not be viable unless it is a member 
of a large group of other similar systems. This approach is developed 
both from a metric and from a relational point of view. Some relational 
characteristics of systems of reactions which possess the elementary 
metabolic properties of organisms are studied. 


I. General Considerations on Mathematical Btology. While 
hitherto mathematical biology has been developing almost ex- 
clusively on the ‘‘classical’’ basis, using physical considerations 
which apply to ‘‘macroscopic’’? systems, the need for the con- 
sideration of life phenomena on the molecular level has become 
quite apparent. There is a wealth of experimental evidence to the 
effect that ‘‘life begins on a molecular level.’’ There is, however, 
no clue whatsoever toward even a remote understanding of the 
basic principles involved. 

There is overwhelming evidence that DNA plays an important 
role as a basic chemical unit of life. The ingenious Watson-Crick 
model of the DNA molecule and of the mode of its replication led 
a number of scientists to assume, explicitly or implicitly, that the 
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DNA molecule may be or is the living molecule. Even more fre- 
quently is the DNA molecule identified with the gene which itself 
is considered the living molecule. 

Criticism of the very concept of a living and self-replicating 
molecule has not been lacking (Oparin, 1957). Also the identifi- 
cation of the DNA molecule with the gene has been challenged 
(Chayen, 1959). Yet, even if these criticisms are not accepted, 
the sad fact remains that the Watson-Crick model does not really 
explain anything, in the sense that explanation is meant in physics. 

Strong evidence that the replication of the DNA molecule pro- 
ceeds according to the Watson-Crick picture is given by experi- 
ments of J. H. Taylor e¢ al (1957). Those experiments, however, 
have also been challenged by the work of L. F. La Cour and 8. R. 
Pele (1958). Similarly, the phenomenon of transduction seems to 
indicate clearly that DNA is the bearer of primary genetic infor- 
mation. Yet, as R. Rosen (1960) has shown, the interpretation of 
DNA as merely the observing instrument that reads the primary 
information is also in agreement with the phenomenon of transduction. 

Assuming, for the sake of argument, that the prevailing views on 
the DNA molecule are correct, we still are left with a large number 
of unanswered questions. If DNA replicates itself by the separa- 
tion of the two strands and a subsequent building up of the new 
strands, then the question immediately arises as to why the strands 
separate in the first instance. If separate strands represent a 
more stable configuration than the joint double strand, then why 
do the less stable double strands reform after separation? No 
specific explanation of that has been offered, except the rather 
vague general idea that the DNA molecule initiates the synthesis 
of proteins which in their turn initiate the syntheses of other 
products that form the environment of the DNA. As a result of 
this chain of reactions there may be changes in the environment 
that change the conditions of stability of DNA. This may result 
in, firstly, the separation of the two strands, and later in the re- 
formation of new double-stranded molecules. But this sort of 
vague reasoning can hardly be considered as an explanation, in 
the sense, for example, that diffusion drag forces may tentatively 
be considered an explanation of cell division. True enough, this 
latter explanation is inadequate. But it offers a specific picture 
of a mechanism and, what is more important perhaps, it has pre- 
dictive value. The same may be said about various mathematical 
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models of the central nervous system which explain quantitatively 
a large number of phenomena. 

Moreover, all the points of view which regard the DNA molecule 
either as something close to the fundamental living unit or even 
as that fundamental unit itself seem to completely disregard the 
fact that self-replication is not the only characteristic of life. If 
things were that simple, then the old-fashioned analogies between 
self-reproduction of crystals and life might have been taken more 
seriously than we can take them. Self-replication is one of the 
important characteristics of life. It is one of the conditio sine 
qua non of life. But there are other aspects of life which form 
equally important conditiae sine qua non. As we have pointed out 
in several of our writings (for a summary, see Rashevsky, 1960b), 
a characteristic property of an organism, whether macroscopic or 
submicroscopic, is the process of selection from the environment. 
The finding of food, its digestion, the eventual assimilation of 
digested parts and rejection of undigested ones are all logical 
subsets of the set of selective phenomena. Even locomotion to- 
wards food represents a process of selection. 

Another important characteristic of the organism is its adapta- 
bility. An organism, as a rule, does not confine its food only to 
one very specific type. The greater the adaptability, the greater 
the probability of survival. 

Any organism, no matter how large or how small, must possess 
those properties. Viruses, which are the smallest known organ- 
isms, do possess them (Rashevsky, 1954, 1960b). Therefore, if 
we wish to develop a theory of the simplest organisms, the theo- 
retical picture must include and explain all these basic properties 
in some specific manner. This requirement imposes certain re- 
lational restrictions, or relational characteristics, even upon the 
simplest organism (Rashevsky, 1959a). If there exists such a 
thing as a living molecule, then that living molecule is subject to 
the same relational restrictions. 

It is important that the essential conditions imposed upon any 
living organism are of a relational rather than of a quantitative 
character. Organisms manifest themselves only to ihe extent that 
they represent physical systems possessing various physical 
characteristics, yet the mathematical properties that are common 
to all organisms are of a relational character (Rashevsky, 1960b, 
chap. xxviii). Physics abounds with universal constants, such as 
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the electronic charge, the mass of electron, the mass of proton, 
Boltzmann’s constant, Planck’s constant, etc. There are no uni- 
versal constants in biology. There are, as a matter of fact, no 
constants at all in the sense that the physicist understands the 
word. And this absence of constants is not due to lack of precision 
measurements on biological objects. It is inherent in the nature 
of biology. Instead of numerical constants, biological phenomena 
are characterized by some constant relations which recur in all 
organisms. 

All the above, together with arguments advocated previously 
(Rashevsky, 1959b), strongly suggests that the approach to mo- 
lecular biology should be from the relational rather than from the 
quantitative point of view. An attempt by us in that direction indi- 
cates the possibility of a theory of molecular systems that do 
possess the properties characteristic of organisms (Rashevsky, 
1959b). A different attempt in a somewhat different direction has 
been made by Robert Rosen (1960) in his study of quantum- 
mechanical aspects of genetics. 

If we are to discuss biological phenomena on the molecular 
level, then introduction of quantum mechanics seems to be un- 
avoidable. As we pointed out elsewhere (Rashevsky, 1959b) it is 
interesting that, of all branches of modern physics, quantum me- 
chanics shows more relational aspects than others. Yet it may 
happen that quantum mechanics will be found insufficient for the 
explanation of biological phenomena. In that case an extension 
and further generalization of quantum mechanics may be indicated. 
As we shall see, however, a different approach to the problem may 
be found. We thus have before us in molecular biology a wealth of 
experimental facts, at times contradictory and inconclusive, and 
practically no specific framework for the understanding of those 
facts. The situation is not unlike the one that existed in general 
macroscopic biology some 30 or 40 years ago, before the present 
mathematical biology was developed. At that time we felt that 
progress towards an understanding of biological phenomena would 
be made only through a preliminary, purely abstract study of 
various conceivable possibilities, rather than through attempts at 
building theories about isolated specific phenomena without ade- 
quate general theoretical background. In 1927 we wrote: ‘“‘If we 
are to find eventually the mechanism or mechanisms responsible 
for cell division we should first investigate as many as possible 
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concetvable mechanisms in abstracto. A comparison of the con- 
clusions drawn from such a theoretical study with observed facts 
will then indicate which of the theoretically possible mechanisms 
are actually realized in nature.’’ We still endorse this thinking in 
regard to the explanation of any phenomenon in biology. An ex- 
planation of a specific phenomenon without a general theoretical 
background seems at least to be made more difficult. An abstract 
theoretical background has always been provided in the physical 
sciences. Frequently this background has been on a purely mathe- 
matical level. Without the purely abstract studies of Gauss, Rie- 
mann, and Lobachevsky on non-Euclidian geometries, and without 
the subsequent work of Ricci and Levi-Civita on tensor analysis, 
the work of Einstein would have been impossible. Our universe 
turned out to be non-Euclidian. But if a general abstract theory of 
non-Euclidian spaces were not available, how could even a genius 
like Einstein have discovered the non-Euclidian character of our 
universe? 

In this paper we propose to take a few first steps towards the 
development of an abstract mathematical molecular biology. The 
very first steps in that direction may be considered perhaps as 
represented by the above mentioned work of N. Rashevsky (1959b) 
and Robert Rosen (1960). In line with what we stated above, the 
most important aspects to be studied in our approach may be the 
relational ones. However, the quantitative aspects should by no 
means be neglected. Quantitative and relational biology do not 
exclude but rather supplement each other. 

Il. The Minimum Size of a Living Unit. Whether such a thing as 
a ‘‘living molecule”’ exists or not, it appears that life is connected 
to the existence of very large molecules. This is usually ex- 
plained by the complexity of the functions which those molecules 
must perform. In the last decade the point of view gained as- 
cendency that the minimal size of a living unit is determined by 
the amount of information it must store in order to be able to re- 
produce another unit like itself. This view again puts emphasis 
on self-reproduction. As our recent studies (Rashevsky, 1960c) 
show, there are a number of difficulties connected with the above 
idea. Not the least one is that much smaller molecules could 
replicate themselves on the basis of information content only. 

In our above mentioned work on a possible quantum-mechanical 
relational interpretation of self-replication, we tried to explain the 
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very large size of the self-replicating unit by the requirement of a 
sufficiently large number of energy levels, with the transition be- 
tween them assumed to be responsible for the process of self- 
replication. Since those levels correspond in our theory to dif- 
ferent possible modes of breakdown of a molecule, the latter must 
consist of a very large number of atoms. 

This explanation is, however, rather artificial because quite a 
sufficient number of levels, in fact a countably infinite one, is 
provided by the electronic levels of even a simple molecule. If a 
molecule dissociates or breaks down into components, starting 
with different states of electronic excitations, the energies in- 
volved in the breakdown will differ. We thus may well have, even 
in a small molecule, enough different transition energy values to 
satisfy the condition (19) of Rashevsky, 1959b. 

If we abandon the preconceived idea that information content for 
self-replication is the determining factor of the size of a living 
unit, we find other possible factors. Almost three decades ago 
(Rashevsky, 1931) we suggested a simple mechanism by virtue of 
which a liquid system, or drop, could grow once it is above a 
critical size but could not exist below that size. The considera- 
tions were based on a physically macroscopic picture, considering 
‘‘matter in bulk.’’ It is, however, possible to translate those 
ideas into ‘‘molecular’’ language. Let us first recapitulate here 
those ideas. 

Consider a spherical liquid system with radius 79, consisting of 
a substance S, or of a set S of different substances. Let S be 
formed from components A; which are present in the surrounding 
medium and which can freely diffuse into the system. Let the for- 
mation of S occur at a constant rate g, whenever all the compo- 
nents A; are present. Let, however, other substances B; be present 
in the surrounding medium. Let B; not diffuse into the system. In 
other words let S be impermeable to the B;’s. Let, however, the 
B,’s react with S at the surface of the system, causing the de- 
struction of S at a constant rate qo. 

Under these conditions the mass of the system will increase at 
a rate proportional to the volume (4/3)zro° of the system, and de- 
crease at a rate proportional to the surface 4aro’. If 7 is the 
density of the system S, then we have 
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Hence a system with r< ee will gradually disappear, while a sys- 
tem with r> ro will grow and, if proper conditions exist, may for 
example multiply by the action of diffusion drag forces. Since the 
formation of a system with a finite radius ro must begin by the 
formation first of smaller systems, therefore such a system cannot 
be formed spontaneously. New systems of that type can be pro- 
duced only by fission of already existing systems. 

Thus the existence of a minimum size of the system is con- 
nected with the impossibility of a ‘‘spontaneous generation’’ of 
the system. Both are due to the circumstance that the sufficient 
size ‘‘protects’’ the system from the destructive effect of the 
surroundings. 

The above model is of course unrealistic and serves only as an 
illustration. We shall now apply an essentially similar idea to a 
set of hypothetical molecules which possess the basic properties 
of the living and which are not each restricted to a minimum size. 
We shall also generalize the picture by considering that the de- 
struction of each such molecule occurs spontaneously and is not 
affected by the external medium. 

Basically any living system, and therefore also our hypothetical 
molecule, can be considered as what Rosen calls an Sl, -system 
(Rosen, 1958; Rashevsky, 1960c, 1961). As we have seen (Ra- 
shevsky, 1959a), the minimum relational requirements for a living 
system lead to an ‘il,-system. Therefore we may consider that 
our hypothetical living molecule is an il, -system. 

Due to catabolic processes the M-components of an Jll,k-system 
eventually cease to exist, ‘‘die out.’’ They are, however, con- 
tinuously repaired and reestablished by the /-component. Rosen 
has shown that for purely topological reasons it is impossible for 
every component of an ‘,R-system to be reestablishable. An Ul, 
R-system may contain components, called ‘‘central,’’ the loss of 
which stops the operation of the whole system. 
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Rosen’s theorem does not imply that some components will 
eventually ‘‘die’’ because of catabolic processes. In a case of 
slow wear and tear, every M-component of an ‘i, -system will be 
maintained by its corresponding R-component. But if the failure 
of an M-component occurs relatively suddenly because of an actual 
injury, then not every suddenly failing component is reestablisha- 
ble. In small S,R-systems, sudden failures may be expected to 
occur because of inherent molecular statistical fluctuations, some- 
what in the manner in which some self-circuited neuronic cycles 
will sometimes ‘‘naturally”’ fail (Rashevsky, 1960b, p. 15). 

If a central component is also non-reestablishable, then either 
it must be of an essentially different nature from the other com- 
ponents in the sense that it is not subject to accidental destruc- 
tion, or else the whole ‘il, -system can exist but a limited time. 

Consider the latter case. In particular consider a ‘“‘molecular’’ 
jl, R-system that has one central, non-reestablishable component, 
My. When, because of some accident which can happen to any 
complex system, Mo is destroyed, the whole {l,#-system ceases 
to operate. 

Let, however, the output of one component or the combined out- 
puts of several components of the {l,-system result in the repro- 
duction of the whole Si,R-system. Let the time it takes for an 
N,R-system to reproduce itself be Tt. If the average life of the 
central component Mo is much shorter than T, then such an {,®- 
system, though capable of self-reproduction, will not reproduce. 
Before it can reproduce, it will ‘‘die’’ due to the failure of Mo. 
Thus such a system would be incapable of existence. 

Consider, however, a large number N of such M,-systems. 
Let them be ‘‘coupled’’ in the following manner. Let it be pos- 
Sible for the component Mo of each system to provide inputs not 
only into the set Gy, of components of its own Wt, R-system, but 
into the corresponding sets of other Jl,-systems. In such a case 
if the component Mo of a given ‘ll, R-system fails, the system will 
remain functional as long as there are any other functional i, R- 
systems coupled with it. 

Let the probability of failure of an Mj-component during the 
time interval dt be adt. Then, if there are N N,k-systems, and 
hence N Mo-components present, the Mo-components will decay 
according to the equation 
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Denoting by No the number of functional I,R-systems at ¢=0 
i . = 
and by N(¢) their number at any other ¢, we have 


N = Noe, (5) 


As long as at least one Mo-component remains functional, the 
set of No Il,#-systems, which are functional at ¢t = 0, will remain 
functional. The time it takes for all components but one to decay 
is given by 


| 
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Because of the statistical nature of the process of failure, 
equation (6) is not meant to give ezractly the time at which only 
one Mo-component will remain. It gives only the order of magni- 
tude. However, if 


ic<eide (7) 


then before all the Mo-components have failed, new ‘ll, #-systems 
with new %l,R-components will have appeared because of self- 
reproduction. 

Introducing (6) into (7) we find 


Noi s>-e >". (8) 
If adt is the probability of an Mo-component failing during the 
interval of time dt, then 1/a represents the average life-span of 
an Mo-component. This is readily seen by the following con- 
sideration. Instead of the infinitesimal time interval dt, consider 
a small but finite interval At. The probability that Mo will fail at 
the time between ¢ and ¢ + A¢ is equal to the probability that it 
will not fail at any of the ¢/A¢ intervals preceding ¢ and will fail 
at ¢+ At. The probability of not failing in the interval Az is equal 
to 


ta Ag; (9) 
Hence the probability of not failing prior to ¢ is 
peony (10) 
and the probability of failing exactly between ¢ and ¢ + At is 


p(t)At = aAt(1 - a Aty/O4 (11) 
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As t—>0, At = dt, expression (11) tends to 


p(t)dt = ae “~'dt. (12) 


Expression (12) gives the probability of Mo living to an age be- 
tween ¢ and t+ At. Hence the average age To of an Mo is 


Tite f sips ~. (13) 
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In view of (13), inequality (8) may be written 
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It is plausible that the life spans of all components are of the 
same order of magnitude. If n is the number of components of the 
§l,R-system, then T, the time necessary to produce all the com- 
ponents of the replica, will be of the order of nT,, if an isolated 
{i,R-system should be able to exist even for a short time. The 
number 7 is likely to be about 10 to 15. Hence No > 104— 10% 
This is in general agreement with what is considered the smallest 
number of units of which a ‘‘living molecule”’ is built. 

We see that the minimal size is determined in this case by the 
interaction of a number of jl,k-systems. FRelationally the situa- 
tion is analogous to the interaction between individuals in a 
society. An individual living alone has a lesser chance of sur- 
vival than a large group of individuals. When a lone individual is, 
for example, ill and unable to take care of his needs, he may die 
before he produces a progeny. In a society, others may take care 
of him during the illness and the probability of survival is increased. 

We have pointed out elsewhere (Rashevsky, 1954, 1960b) that 
the principle of biotopological mapping, or, as we prefer to call it 
now, the principle of biological epimorphism, may be extended to 
organized biological units of higher order. The possibility of a 
relational mapping of societies on individual organisms gives the 
answer to the long observed similarity between societies and 
organisms. In the present instance we have a mapping of a set of 
social interactions upon one of the simplest conceivable sets of 
biological interactions. 

Instead of the somewhat abstract picture of an Wl,R-system, we 
may visualize the situation studied here by considering a reaction 
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chain of a definite structure. Let us, for example, take the follow- 
ing system of reactions: 


Jee as (15) 
Coe aes wy 
A,+A,— 24, (16) 


Here the slanted arrows indicate that each component A; reacts 
with some molecules of the environment to give Ajay. 

Those reactions may be considered either from the ordinary 
chemical point of view or as a chain of quantum transitions from 
different states, as mentioned at the end of a preceding paper 
(Rashevsky, 1959b). In either case it is essential that the life 
span of the intermediate product A, be long enough so that (16) 
can take place. If A, decays before A, is formed, A will not be 
duplicated. Thus, if the life span Ty of A, is less than the dura- 
tion Tt of the chain of reactions (15), neither (15) nor (16) will 
occur. 

Consider again a set © of such reactions as (15) and (16). 
During the part of the process between A, and 4,, the presence of 
A, is not essential for that particular reaction of the set G. All 
that is necessary is that at the time when 4, in a particular 
element of the set Cis formed, an A, be present in some element 
of the set ©. Assume for simplicity that the duration of each step 
of (15) is the same. This duration is then equal to T/n. Let the 
set © contain No elements, such that the initial moments of each 
element are distributed uniformly. Consider a particular element 
of © at the time ¢ when A, is just formed. The probability that, in 
a particular other element of ©, the component A, is produced 
within the interval t+ tT/n is No/n. To insure that at any time 
when an A, is produced, there will be always a sufficient number 
of A;’s to enable reaction (16) to proceed, requires that 


No >> 2. (17) 


Again we see that in order for a chain of reactions (15) and (16) 
to ‘‘survive,’’ a sufficiently large number of such reactions must 
take place simultaneously with a ‘‘coupling’’ between the different 
reactions. If n ~ 10, then No ~ 100 will already be sufficient. 
But each reactant A; is likely to be composed of about 10 atoms, 
so that the total number of atoms present at one time during the 
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reaction chain (15) is about 100. The total number of atoms 
present in the whole set © will again be about 104. 

The whole set © represents an association not of static systems 
but of dynamic systems of reactions. 

The association of a large number of systems may also enable 
the systems to survive if each of them produces a substance which 
forms an envelope that shields the system from some harmful 
effect_of the environment. 

The necessity of having a sufficiently large number of systems 
in order to secure ‘‘survival’’? may also be due to a different 
cause. Instead of disintegrating spontaneously, the systems may 
disintegrate as a result of reaction with some substances present 
in the environment. As part of their metabolic activities, each 
system may produce a type of molecule which is capable of form- 
ing a protective coating that shields the system from deleterious 
effects of the environment and yet is permeable to other sub- 
stances present in the environment and necessary for the normal 
functioning of the system. 

Consider, for example, No systems clustered in a spheroidal 
ageregate of radius ro. Let the volume of each system be v. Then 
approximately 


4 


smty = Nov. (18) 


If the protective coating must be of thickness A, which for simpli- 
city we assume to be much smaller than ro, then the volume of the 
coating must be 


0, = 4areA, (19) 
and its mass m, is, denoting by 6 its density, 
m. = 4nA8r¢°. (20) 


Let the protective substance be produced at a constant rate g by 
each system, and disintegrate at a rate km. where k is a constant. 


Then 
dm 


c 


= Noq- km,, (21) 


and in the steady state 


_Noq 
Pa gE (22) 
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3 
nae eee (23) 
7 


Introducing (23) into (20), and remembering that the right side of 
(22) must be equal to or greater than the right side of (20), we find: 


From (18) we have 


Pa, Bey, S 
Wns oor oS k 
q 


(24) 


The arrangements of the systems may be different from spherical. 
But in general the area of the protective surface per system will 
increase less rapidly than the number of systems. While a small 
number may not produce enough protective substance to survive, a 
sufficiently large number will. 

We may consider, for example, each system as a sphere of 
radius ro. These spheres may be arranged in a string of length 
2Noro. If the coating is to be a cylinder of radius ro, thickness 
A << 79, and length 2Noro, closed by ‘‘flat’’ surfaces at each end, 
then the mass of the cylinder is 


Ne = Qnrg Ad x 2No?o + 4nrZAd = 4nr¢?AS(No at 13 (25) 
Putting 
q 
we have from (22) and (25) 
A 
> 2 (27) 
No > (aA 


If 1> A, and A>1-—A, No will be very large. For A> 1, no 
matter how large No, a sufficiently heavy coating will not be 
produced. 

Ill. A Relational Approach. In a previous paper (Rashevsky, 
1959a) we discussed some minimum relational requirements which 
a living system must satisfy. We considered there possible graphs 
of the simplest living systems. We made, however, the unneces- 
sary assumption that the living unit is made of some partaculer 
substance which carries or performs the different biological func- 
tions. We shall now drop this unnecessary restriction and con- 
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sider that the living unit is a system of reactions, no single com- 
ponent of which és the living unit. 

By the same argument as in loc. cit. we consider as the simplest 
possible case two types of molecules, E.4 and Eg, each of which 
is made up of a number of components that are present in the en- 
vironment. Let E.4 be made up of the elements of set U=iA,, 
A,,..-A,}of components, while fg is made up of the elements of 
the set B={B1, B,,...B,,}. In general 


Unk GO. (28) 


We shall agree to denote by 2 and 5 only the sets of the corre- 
sponding components when every one of those components is free 
and not bound to any other chemical components. In other words 
A,;eU, and B;e 8, if and only if A; and B; are in a free state. 

Let, however, the reactions which lead to the formation of E 4 
and FE’, respectively, and which we shall denote symbolically by 
%Y—E, and B—~E,, never take place spontaneously. Let the 
reaction I—>E, take place only in the presence of Eg as cata- 
lyst, while 8—~Eg takes place only in the presence of Ey as 
catalyst. In that case, even though % and % may be present, 
neither EF, nor E'g can be formed unless both are already present. 
The reaction system can then be written thus: 

U+ Ep—E, +Ep; (29) 


B+E,-Ep+Ey . (30) 


The graph of such a system is a directed two-cycle (Rashevsky, 
1959a) with two environmental inputs (Figure 1). The set of re- 
actions (29) and (30) represents a system which cannot be formed 
spontaneously. If, however, it already exists, it will, in general, 
increase in mass. More and more molecules Ey and Ep, will be 
formed. 

Such a system resembles somewhat a living system, but it does 
not possess another important characteristic of the living, namely 
the ability of extracting the elements of the sets Wf and 8, which 
are necessary for its production, from other aggregates in which 
the A,’s and B,’s are bound. 

Let © denote the set of naturally occurring compounds which 
contain the 4;’s in a bound state. Similarly let 9 denote the set 
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E Re 


A 


B A 


FIGURE 1. 


of naturally occurring compounds which contain the B,’s in a 
bound state. Then, if the system which we consider could be 
called a primitive organism, it must be able to extract the sets 
UY and & from © and © respectively. This is the most general 
formulation of the process of digestion. Such an extraction is 
done by an appropriate enzyme. We could consider the case that 
E,4 not only catalyzes the reaction B—~E, but also catalyzes 
the reaction €— W%, or S—~§, or both. Or we may consider that 
E, catalyzes €— >, while Eg catalyzes S—>8. Figure 2 
represents two possible situations. The environmental outputs 
are now © and 9. The system in Figure 2a consists of three two- 
cycles, formed at the vertices. In Figure 2b there are one directed 


two-cycles and two directed three-cycles. 


B A 


FIGURE 2a, 
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FIGURE 2b. 


Systems of that type have the following properties of living 
organisms. The system, which we shall call L, does digest the 
‘‘food’’? © and ®. Of the products of digestion, the sets UJ and 5 
are ‘‘assimilated’’ and used for building FE, and Eg. Any remain- 
ing components of © and 9 which do not belong to either U or & 
are ‘‘rejected.’’ The system L also increases in size. There is, 
however, no direct process of duplication if we consider the sys- 
tem L as composed of a large number of molecules. If, however, 
we consider the system as being composed of only one molecule 
of E.4, and one molecule of Ez, interacting with each other, then 
every time a new pair £,, Ep is formed, the system is duplicated. 
We cannot speak of a self-duplicating molecule, but we have a 
self-duplicating system, consisting of one pair of different inter- 
acting molecules. 

The above two L-systems are perfectly legitimate abstract con- 
ceptions. They are, however, not very plausible biologically. In 
both of them the same enzyme which synthetizes a compound from 
its components also acts as a “‘lytic’’ or ‘“‘digestive’’ enzyme, 
decomposing another compound. This is not a very likely situa- 
tion. We shall now consider a somewhat more complicated system 
which obviates this difficulty. 

Starting again with the system represented in Figure 1, consider 
the case in which Ey, in addition to catalyzing the formation of 
Fz from 8, also catalyzes the formation of a molecule Dg from 8. 
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In other words, the catalyst FE, is not absolutely specific for the 
formation of Ez only. Similarly, let Ey catalyze the formation of 
of a molecule Dg from 8. Let Dg catalyze the decomposition of 
%, in other words the reaction )—~8, while D5 catalyzes €— Y. 
The graph of such a system is shown in Figure 8a. 

We may consider an alternate case, shown in Figure 3b. Here 


Eg catalyzes the formation of D¢ from %, while E, catalyzes the 
formation of Dg from §&. 


A still more general case is that in which both D¢ and D5 con- 
tain as components elements of both sets %l and %. In other words, 
D « as well as Dg are formed from 2 8. The set of reactions in 
this case is 


phe Oh, + ha 
a hy Eg + ER 
es etl ag oD gi 4 
U+ B+ Eg——De+ kp one 
D+Ds5—5 
€+De— 


In representing this set of reactions by a graph, we meet with 
the difficulty inherent in the representation of this type of phe- 
nomena by graphs (Figure 3c). In Figure 3a, it is clear that the 


¢ 


FIGURE 3a. 
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FIGURE 3b. 


input of % into Eg, results in two outputs, namely the production of 
E, and the production of Dg. In Figure 3c, however, there is an am- 
biguity. The production of E, and Dg may need, according to this 
graph, either both 8 and W, or the production of Dg may need only 
%, while the production of Ep needs only $. With Rosen’s as- 
sumption of ‘‘non-contractibility’’ (1958a), only the first interpre- 
tation could be assumed. Yet none of the three interpretations is 
the correct one. Recourse probably will have to be made to Rosen’s 
‘abstract block diagram’’ (1958b). But it should be generalized to 
consider components with more than one output and contractible 
inputs. 


D D, Dg ¢ 


FIGURE 3c, 
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Another difficulty indicates further the need of generalizing 
Rosen’s ingenious method. In the systems considered here, there 
is no sharp distinction between outputs (or inputs) and components 
as carriers of those outputs (or inputs). In Figure 3a we may con- 
sider the molecule E4 as a component, in Rosen’s meaning of the 
term. Then % may be considered as the input into E4, from the 
environment as a component, and Eg as the output. But Ep, may 
itself be considered as a component. From the point of view of 
mapping (Rosen, 1958b) we may say that E.,, acting as a com- 
ponent, maps 4 onto Hz. But it seems to be more legitimate to 
say that the system as a whole maps BUE, onto Eg. The sys- 
tems considered here possess not only the property of being 
organized, thus representing organisms, but they also are indi- 
viduals, meaning un-dividable. The individuality in the above 
meaning of the word is also one of the basic properties of an 
organism. 

Another important property of the system considered above is a 
certain vagueness of the distinction between the system and its 
environment. The sets © and © in Figures 3 clearly constitute 
the environment. But this is not so obvious about 2! and 4. The 
unquestionably environmental inputs correspond to edges that do 
not belong to any cycles. The sets U and 5 are the results of the 
action of a part of the system (D¢ and D5) upon the environment, 
and they in turn act on the system. In Figure 3a, for example, the 
edge EzDg¢ does belong within the system, but the edges Dg and 
UE, of the same three-cycle belong to the environment. 

This unsharpness is characteristic also of actual organisms, 
whether on a low or high level of development. Blood is certainly 
part of the organism, yet it forms the milieu interieur for that very 
organism. With extracellularly digesting bacteria, the products of 
digestion are definitely outside the organism. Not so definite is 
the situation with intracellular digestion. 

The graphs shown in Figures 1-3 have one characteristic in 
common. They are, topologically speaking, symmetric, in the 
sense that the right and left sides of each are homeomorph. (We 
cannot, of course, speak in this case of the usual metric sym- 
metry). Physically, however, the two halves are not identical. 
Thus E, is different from Eg, D¢ from Dg, but the two halves of 
each graph are complementary to each other. 

Suppose the system, represented by any of these graphs, say 
Figure 3a, is separated into two subsystems by removing the inter- 
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action between Ey, and Eg. Then Ey, since it exists, will rebuild 
Ez, while Eg will rebuild E,. If © and ® remain available to 
both substances, then, if separated, each system will become re- 
constituted by rebuilding its complement. 

It must be kept in mind that the graphs in Figures 1-3 do not 
reflect any actual spatial arrangement of the components. Thus 
when we spoke above about a separation of the system into two, 
we meant a functional rather than a geometric separation. A large 
number of L-systems may well constitute a homogeneous phase, 
without visible macroscopic structure. By a slight complication 
it is, however, possible to give a geometric meaning to the word 
‘‘separation’’ in this connection. We may consider the case that 
either E,, or Eg, or Dg, or Dg, or any combination of them, acts 
as catalyst for the production of a substance S, which serves as a 
structural background to which the components are actually at- 
tached. It may be either a large molecule or a cluster of mole- 
cules to which the components are held by adsorption. Or S may 
form a protective envelope which holds the components spatially 
together. In such cases we can actually speak of the separation 
of the L-system into complementary parts, each part then reconsti- 
tuting the whole system. The reader will of course have noticed 
the analogy of this situation with the prevailing concept about the 
replication of the DNA molecule. We must, however, emphasize 
that the above considerations in no way offer, nor are they in- 
tended to offer, any ‘‘explanation’’ of the mode of replication of 
DNA. All the difficulties in regard to this mode of replication 
mentioned in section I (p. 238) remain; e.g., what would make 
the system break up? 

There is also no minimum size limit imposed on the L-system. 
Again, if some of the components are short-lived, considerations 
similar to those made in section II may lead to a minimal size of 
aggregates of such systems. A replication of such aggregates 
may then occur through macroscopic mechanisms such as diffusion 
drag forces, osmotic forces, elastic stresses, and the like. In 
such a replication of the aggregates, however, the individual sys- 
tems remain intact. We have a constant increase in the number of 
L-systems, which can take place only when at least one L-system 
is present. In this sense we have, as remarked above (p. 250), a 
replication on the molecular level, or a first order replication. To 
that we must add the replication on the macroscopic level, or 
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second order replication. From the principle of biological epl- 
morphism (Rashevsky, 1960b), we expect replications of higher 
order, which we indeed observe. Such replications are found in 
higher organisms, which may be considered ‘societies of pels. 
and even in socteties of higher organisms. Animal, as well as 
human, societies do reproduce or replicate, and in many such 
cases the appearance of a new society can take place only when 
another society already exists. 

IV. Relational Aspects of a Self-Replicating Molecule. As we 
have seen, the L-systems replicate as systems which cannot be 
meaningfully divided into independent components. They replicate 
as individuals because of the interaction between these com- 
ponents. This corresponds precisely to the view strongly ad- 
vocated by Oparin (1957). 

Our own studies (Rashevsky, 1959a,b), as well as those of 
Robert Rosen (1959), indicate that such a systemic approach is 
more plausible than the approach which endows a specific mole- 
cule with the property of self-replication. However, in an abstract 
study like this one, we should not prejudge any approach until its 
implications have been thoroughly studied in abstracto. Therefore 
we shall now investigate the relational aspects of the picture 
which corresponds to some current concepts about self-replicating 
molecules, in particular about DNA. 

As we pointed out elsewhere (Rashevsky, 1959a), a self-repli- 
cating simple molecule is represented topologically by a directed 
loop, that is a graph with only one vertex and one directed edge 
(Figure 4). Whatever the conceivable physical mechanism of such 
a replication may be, it is evident that it can take place only 
if all the components needed in the building of the replicating 
molecule are present. By the same reasoning as in section III (p. 
250), we see that a self-replicating molecule, which merely repli- 
cates in the presence of all necessary components and does not 
do anything else, could not be considered as endowed with the 
properties of the living. At best it would be observed as a sub- 


FIGURE 4. 
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stance which increases in amount autocatalytically until the 
supply of the component substances is exhausted. 

Hence such a molecule which we shall designate as /,, must, in 
addition to self-replication, also catalyze the production of some 
substance that, either directly or through a chain of intermediate 
reactions, extracts the components of the /,,-molecule from ‘‘rough’”’ 
food products. Thus, denoting by 2, the set of components of 
which J,, is built, by 8,, the set of components of which is built 
the catalyst D« that catalyzes the reaction ©—~2I,,, we may have 


y 


D th oe 


ne i oe ee 


(32) 


The reaction is represented in Figure 5a. 
We may complicate the scheme ad libitum, as shown for example 
in Figure 5b. It is represented by the set of formulae 


b= Moat 2h. 4D 2 4 Dig 
Ds +0, (33) 
Dg + R— I, 


In any case the topological symmetry of the graph is destroyed. 
Moreover, and this is the important point, although the molecule 
l,, teplicates itself, it does it only as a part of an indivisible 
system. Thus we come to the conclusion, already reached by 
Robert Rosen (1959) by a very different argument, that the concept 
of a single self-replicating molecule is self-contradictory. Im- 
plicitly this is recognized in the accepted somewhat vague pictures 
of the replication of DNA. The DNA molecule is considered as 


FIGURE 5a, 


> 
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FIGURE 5b. 


catalyzing the production of a protein which, directly or through a 
complex chain of reactions, leads to the availability of the purine 
bases and other components needed for replication. 

The above statement in no way precludes the usefulness of a 
theoretical consideration of such systems as shown in Figures 5. 
In their relational properties they are different from the systems 
represented in Figures 1-3. A mathematical abstract study of the 
principal similarities and differences between the two classes of 
systems is likely to lead to conclusions that may indicate which 
of the two classes is actually found in nature. The possibility of 
still different classes of systems is of course not precluded. 


This work has been aided by United States Public Health 
Service Grant RG-5181. 
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A solution in quadratures is found for a special class of differential 
equations suggested by studies in the growth of competing populations. 


The simultaneous equations 
dz/dt = a(b — x — f(y))x 


dy/dt = c(d ~y — g(#))y 


describe the competition between a population of animals feeding 
on another population of animals or plants. This pair of equations 
was first studied in this connection by Hutchinson (1947) and then 
by Cunningham (1955). Special cases of (1) occur in all of the 
mathematical literature on competition between species. 

In general (1) can not be integrated and one must be content to 
find properties of dy/dz in the z,y-plane. In this note we observe 
that one can integrate the corresponding equation in the 2,y-plane 
when the equations are of the form 


(1) 


da/dt = p(— x — my)« (2) 
dy/dt = mp(s — y — P(@))y 


where P(x) is any function for which P(x)/x is integrable. In 
particular P(x) may be a polynomial, a case of practical interest in 
populations. 

From (2) we secure 


dy _m(s ~y ~ P(z))y (3) 
dx —(x%+my)@ 


which is not homogeneous but which yields to the change of varia- 
bles y = ve used to solve homogeneous equations. Upon making 
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this change of variables, (3) becomes the equation 


(1 + mv) Ce (mP(2) se — 2) Fe 
v r 


with variables separated. From this equation one easily secures 


elogy+my=ms +mex (0 
x 


dz. 
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In Yeh, Martinek and de Beaumont (Bull, Math. Biophysics, 20, 
203-16, 1958), a method is presented for determining successively better 
central multipole representations of the current generators in a homo- 
geneous conducting sphere by measuring surface potentials at a suc- 
cessively increasing number of points. This paper generalizes the 
method such that the multipoles may be located at any chosen point in 
the conductor. The spherical harmonic expansion is advantageously 
used and the ‘‘interior sphere theorem’’ of Ludford, Martinek and Yeh 
(Proc. Cambridge Philos. Soc., 51, 389-93, 1955) makes possible dis- 
turbance potential expressions in closed forms. A method for approxi- 
mate determination of the eccentricity is also presented. In the theory 
of electrocardiography, the eccentric multipoles can more accurately 
represent the heart as a current generator with fewer surface potential 
measurements than the central multipoles. 


Eccentric Multipole Expansion of Any Potential in an Infinite 
Domain. Consider two systems of coordinates: one with the origin 
at « = 0, y = 0 and 2 = 0 and the other with the origin at 2 = a2’ = 0, 
y=y =0 and z=2’'+b=5. The relations between the rectangu- 
lar coordinates (z, y,2) and the spherical coordinates (r, 0, p) are 


z =r sin @cos 
y=rsin @sin 9 (1) 
8 =r cos 0 
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and those between (2’, y’, 2’) and (7, 0, 9”) are 
a’ =r’ sin 0’cos ~’=?r’ sin 0’ cos @ 
y’ =r’ sin 0’ sin ~’=r’ sin 0’ sin @ (2) 
2’ =?’ cos 0’ =?’ cos 0” 


since 9’ = Q, 

Let p(r, 6, Po) be a current source density distribution which 
is zero outside the domain r’ <4 9, The potential at a point out- 
side of this domain is given by the following series (see, for ex- 
ample, Morse and Feschbach, 1953, p. 1276): 


HO O=Y. Do Ann ob enlts 6 9) + 


n=0 m=0 


Bo O'S 
where 


bon (t, 0) =(") ¥en (8% ®)s 
0 (9, (4) 


Gon (1, O 9) = (1%) * 1 ¥9, (6, 9) 


are the unit potentials for the even and odd mnth multipoles re- 
spectively and 


(n —m)! 


bs ages = Fe ee 
(n + m)! 
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{ cos (mMPo)dPo x 
0 


7 KX 
[ P™ (cos 6) sin 64464 [ p(t, 85 Pore)" *? dre 
0 0 


(5) 
Bog = Ol 
mn Em (n+m)! x m)! , Sin (MP0) dPo x 
i” m . “o +2 
[8 (cos 6) sin oa6y [ “p(rss 66, oN(r)"*? dra 
0 ) 
are the strengths of the various multipoles (Bo, = 0). 
In equations (4), 
(6) 


Yinn(%, ~) =cos (mp) P,™ (cos ), 
Ynn(@, p) = sin (mp)P,™ (cos 6’), 
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are the even and odd spherical harmonics respectively; Yo, (0, ”) 
does not exist. In equations (5) and (6) P™ (cos 09) is the as- 
sociated Legendre function of the first kind and €,, is the Neumann 
factor, 


&, =1 for m= | (7) 


En=2 for m0 


In equation (8) the term corresponding to n = 0 is the potential 
due to a source at (0, 0, 5) with strength Aoo. The first of equa- 
tions (5) shows that Ago is exactly the total current in 7’ < &). For 
a volume conductor with insulated body surface there is no surface 
current across the boundary; hence 4j, = 0. This is the case in 
the theory of electrocardiography. 

There are three terms for n= 1 in equation (3). They are the 
potentials due to three component dipoles at (0, 0, 5) with strengths 
Ao1, A11 and Bj, respectively. 

The five terms with n = 2 in equation (3) are the potentials due 
to five component quadrupoles at (0, 0, 5) with strengths Aoo, 
A 12, Azz, Big and Boz respectively. 

The seven terms with n = 3 in equation (3) are the potentials 
due to seven component octopoles at (0, 0, 5) with strengths Aos3, 
A13, Ao3, A33, B13, Box and B33 respectively. 

For larger values of n there are 2*-poles,...2”-poles at (0, 0, 5). 
For a discussion on multipoles and graphical arrangements of di- 
poles and quadrupoles see, for example, Morse and Feschbach 
(1953, p. 1281). 

If r is appreciably larger than % 9, the potentials for the higher 
values of n become so small as to be impossible of measurement, 
and all we can obtain are the values of A,, and B,,, for n less 
than some finite value that becomes smaller with large r’, which 
is the distance from the multipoles at which we measure the po- 
tential. Thus the potential of equation (3) is specified whenever 
a sufficient number of eccentric multipole strengths A,, and Byy 
are determined. 


Expressing o¢’(r’, %, ~) in Terms of (7, 9, p). It is advantageous 
to express the potential o¢’(r’, 0’, p) of equation (3) in terms of 
the spherical coordinates (7, 0, p) about the center (0, 0, 0)s For 
instance, one of the harmonic solutions about the point (0, 0, 6) 
may be expressed in terms of harmonic solutions about the point 
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(0, 0, 0) as follows (see, for example, Morse and Feshback, 1953, 
ps 1272): 


(s — m)! 


oe , , —_ Ao-nr—l = 
ohin(”, % P) = 8 Marre 
sti 
(*) ¥2, (0%), 
r (8) 
2 i ete Bee ee 
De (ros p)=6 S (n 


i (n- m)\(s — n)! ‘ 


5 st1 
(2) Yovak@s?)- 
. 


Substituting equations (8) into equation (3) we have o¢@(r, 9, 9) 
which represents the same potential as 9¢’(r’, 0’, ~) with Aoo = 0: 


co 


oB(r, 9, P)= ) oFn(r, % %) (9) 
where F 
eta 00 n (s Ea m)! oe 
GE) EL MRED ES Mrererrrr orci) 
s=n m (10) 


[Mink 1 a (6, ~) + Bryn Yes (9, p)l. 


In practice, it is sufficient to evaluate a finite number of terms 
in equations (9) and (10), because of fast convergency; hence the 


upper limits « for n and s summations may be replaced by certain 
finite integers N and S respectively: 


N 


oP(r, 6, Y) = S of nlt, 0, ~) (11) 
n=1 
where 
a it clea) eye 
obalts 0, 9) = 3 08 et (*) ‘ 


s=n m=0 


(12) 
[Ann Yes (9, 9) + Brn Yo, (8, @)1. 
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Disturbance Potential Due to the Presence of a Sphere. Ac- 
cording to the “‘interior sphere theorem’’ of Ludford, Martinek, and 
Yeh (1955), the disturbance potential ;4(r) = ,4(r, 6, ~) due to the 
presence of a sphere r= 4 > 4 + 5 can be expressed in closed 
form in terms of the original potential o4(r) = oA(r, 6, ~) in which 
Aoo = 0 for the case considered. For the boundary condition 


fe) 
eelae nee) 0 on r=Q, (13) 


this disturbance potential reads 


= m (=") z ~ (*") dr 14 
Ohl er are A : (14) 


In order to evaluate the disturbance potential i:4(r) = i1¢(r, 9, ) 
by means of equation (14), we first consider the disturbance po- 
tential for the component harmonic solution 


ohis (r) = ots (7, 9, 9) = 1°" VE. (6, @). (15) 


Substituting equation (15) into equation (14) we have 


ions (r) = 19ms (r, 0, ?) - 


Mata ar (atelier eee). 
Similarly for 
ohms (1) = obms (t; 9, 9) = 2°" Yms (9, 9), (17) 
we have 
Ors (7) = dims (Pr, 9, 9) = a7 7 87 1(s + Ir > Yns (9, P). (18) 


Substituting equations (11) and (12) into equation (14) and using 
equations (16) and (18), we obtain the disturbance potential 


N 


P(r, 9, ~) = x Pn (rt, 6, P) (19) 


n=1 
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where 
n (s — m)! , 
1Pnlhy 8, 0)= 0. fs Dalat a 
( + ‘ Vn Al [A sey 2 (OS oyteb 2 Vo 2(6%0)i. <(20) 
Ss a a 


Determination of the Current Generator by Surface Measurements. 
Combining equations (11) and (19) we have the total potential at 
any point (r, 0, ~) in the spherical volume conductor 


N 
P(r, 0, ~) = oP(7, 0, ~) i P(r, 0, ~) a ~ b,(?, 0, ?) (21) 


n=1 


where 


ny UF 0, ~) > of alt, 0, ~) an 1Pp(?, 0, ~) = 
ee (s — m)! (2) stl : “ ) ‘ 
: ji aE (n-—m)!(s —n)! L\r ¥ 8 sone 


b s+1 r\s 
(*) (*) | Ane Fine (9, Q) sa Bin seve (4, p)l. 
a 


a 


On the spherical surface r= 4 the total potential at the point 
(a, 8, ) is from ares (21) and (22) 


—n-1 = )! 9 ae 
lates 8 mp Seo 


stl 
(*\ [Amn Yee (8, >) + Bmn Yo, (8, )], (23) 


where Y£,(0, ~) and Y°,(6, ~) are functions of the coordinates 
of this point and A,,, and B,,, are constants to be determined. 
For N = 1, there are three unknowns Ao;, Ai; and By;. For N = 
2, there are eight unknowns Ao;, Ao2, A11, Aig, A22, Bii, Bie 
and Bygg. For N = 38, there are fifteen unknowns Ao, Aoeo, Aoa; 
Ai1, Ata, Ais, Age, 423, Ass, B11, Biz, Big, Bog, Bog and Baz. 
And so on for higher values of N. The increase of the value S in- 
creases the number of terms in the ¢(a, 8, ~) expression (and hence 
the accuracy of the expression) but does not affect the number of 
unknowns. 
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Suppose the potential difference between the surface points 
Pi(4, 01, (1) and P2(a, 02, 2) is measured. Then equation (23) 
gives one linear algebraic equation for the unknowns AL and vo 

N 
$0, 82, Pa) — H(A, 91, Pi)= Y* BP x 


n=1 
> 


: (s — m)! Oe ei Voy e2 
2c a ee ie 


S=n m= 


24 
[A ade APs, Po) — Yuns (41, i) b AB mnt Doms (Ons Po) - ! 


Were O18 p1)8] 


For N=1 three such equations are necessary to determine the 
three unknowns uniquely. Hence we need the measurements of 
potential differences between the point P, and a third point 
P3(&, 93, 3) and between P; and a fourth point P4(O, 04, 4). 
Thus potential measurements at four surface points determine 
uniquely the component strengths Ao;, 4,1; and B,, of an eccentric 
dipole at (0, 0, 5) as the sole current generator. 

Similarly, to determine uniquely the component strengths of an 
eccentric dipole and an eccentric quadrupole at (0, 0, 5) as the 
current generators, potential measurements at nine surface points 
are necessary. To determine uniquely the component strengths of 
an eccentric dipole, an eccentric quadrupole and an eccentric 
octopole at (0, 0, 0) as the current generators, potential measure- 
ments at sixteen surface points are necessary. And so on for 
higher eccentric multipoles. 

It can be noted here that in Yeh, Martinek and de Beaumont 
(1958) where the more graphic Taylor series expansion (rather than 
the spherical harmonic expression) is used for the central multi- 
pole representation, the corresponding numbers of points on which 
to take potential measurements necessary for unique determina- 
tions are four, ten, and twenty for N=1, 2, and 3 respectively. 
This is because the Taylor series expansion contains redundant 
terms which represent degrees of freedom of the charge density 
that do not give rise to any potential outside the density domain 
and which are omitted from the spherical harmonic series. The 
non-redundant and more compact spherical harmonic expansion, 
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though less graphic, is, in fact, exactly equal to the Taylor series 
expansion (see, for example, Morse and Feshbach, 1953, p. 1280). 


Spherical Harmonics as Functions of Rectangular Coordinates. 
It is useful to record the expressions of spherical harmonics in 
terms of the rectangular coordinates. They are as follows (see, 
for example, Morse and Feshbach, 1953, p. IV Gor 


Yoo =1 

Vog = eee 
Yt, =@/t 
Y%i=y/t 


Yéq = (227 — @ — y7)/2r* 
Y¢, = 3e2/r” 
Y%. = 3ya/r" 


So = 8(@? - y”)/r? 


(25) 
Yoo = Gry/r7 
Y§, = 2(227 — 3¢? — 3y7)/2r* 
fy = 8e(427 — ¢? > y)/9r* 
Y43 = 3y(427 — 2” — y?)/2r° 
Ys, = 15a(e7'- y°)/"* 
Y33 = 302yz/r° 
Y§, = 152(@? - 3y?)/r° 
Y9, = 15y(3x? — y?)/r3 
Se ae =] one 
where 
cme Aig, giants (26) 


These expressions substituted in the series expansion of equa- 
tion (23) enable us to evaluate relatively easily the surface po- 
tential for any given point (a, y,2). 

In what follows we shall carry out the working expressions for 
various degrees of approximations of the proposed eccentric multi- 
pole representations. 
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The Case of N=S=1. For N=S=1 equation (23) becomes 


2 
(0 @) = 3? Y73(F) Cn P51(6, 9) 4 


m=90 


Bmi¥ 21 (6, 9)] = 3 0771491 ¥ 8: (6, 9) + Ari ¥41(0, pye 9) 


Y41(0, p)]. 


This amounts to the representation of the current generator in a 
spherical volume conductor by a central dipole. For instance, 
both Wilson’s equilateral tetrahedron (Wilson, Johnston and Koss- 
mann, 1947, p. 602) and Grant’s isosceles tetrahedron (see, for 
example, Sodi-Pallares and Calder, 1956, p. 128) are special 
cases of this approximation. (See Yeh, Martinek and de Beaumont, 
1958, pp. 213-215.) 


The Case of N=1 and S=2. For N =1 and S = 2 equation (23) 
becomes 


— m)! 9 1 
$(a, 0, 9) = 2B 52 oe FS )» 


s=1 m= 


b\ sti f 2 wo [2841 
(= ) Pea uetot iit winlin bin n ay & 


s=l 


sti 
(= | aH Ao: Yo + An Vie + Bu V4 | 
e (6-1)! (28) 


2 
ae 12) Crypsd ate YM ETe DONE Ee 


BE (Zaye sey th Bagi 
ie (249, Yoo + Ars Yin + By 12) 


e 5b e 
2 [4 (s¥6; +526] +A ee +S yes) ate 
5b 
B,, (a7%, im Yc ne 


This amounts to an approximate representation of the current 
generator in a spherical volume conductor by an eccentric dipole 
at (0, 0, 5). Notice that for b= 0 equation (28) reduces to equa- 


tion (27). 
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The Case of N=1 and S=8. For N =1 and S = 3 equation (23) 
becomes 


pew, (23kk Vel Oa en 
Hla 8, y="? (2) () aoe Spratia 


s=1 
_ ne 
Z Vereen a] = 5b > [35 (Ag, Yo. + 


5 b 3 e (4 
Ay, Y5, + By, te) (240; Yoo + Ay, rast 


a b ‘ e e 
By, Y{.) + r() (3401 Yo3 + Ai, Y{3 + By, r%s)| x 


ae) e 5 € b ; e 
Ao1 3Yo, +5 [— Y6o ei) eee 03 + 
a a 
2 5 (6 F lO ae 
Ay, fave + 3(2) 12 +7 (3) is b+ 
5 [b tee ea 
By, {sy’, He 9 i) Y%2 + 3 @ r%o | 
This is a more refined representation of the current generator in a 


spherical volume conductor by an eccentric dipole at (0, 0, 0) 
Again, for 6 = 0, equation (29) reduces to equation (27). 


(29) 


The Case of N=S=2, For N = S = 2 equation (23) becomes 


3 (sm)! (2941) (b\**! 
p(a, 60, ~) = b Y y aos ) (4) : 


8 a 
S=l1 m= 


2 
Any FaeytEng Bel ae s(-)" View 


m=0 


(30) 
Rp cone eae Ao, (s¥s, + 5 Ya) Ree (svi i. 


5b z 5b LT eee 
oY fa) + By, (sv%, nae dake 3) tagak tA ae 


Ais Yfg + Big Yiq + Aga Y$q + Bag doen 
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This is a representation of the current generator in a spherical 
volume conductor by the combination of an eccentric dipole and an 
eccentric quadrupole at (0,0, 6). For 5=0, equation (30) be- 
comes the representation of the combination of a central dipole 
and a central quadrupole. 


The Case of N=S=3. For N = S = 8 equation (23) becomes 


3 


- : (s — m)! Speen Pr aNae 
Moles Wee oy agg camera eee $ (>) : 


s=1mz=0 


3 2 
= ! 
ils. eh oe dln ba a eae 


Se (2 —m)!(s — 2)! 
ait 3 
(24 ‘a [Ane age Sg OA tk eel ». ite 
Ss ot rea, 3 


4 
@ [As La +Brus jag Ite [Ao {3¥8; ++ 


Bik 5/6 ‘ 
(2) Yoo + (2) va} + Ay {arf a (2) Vig. (31) 
Tel bahia 5 (b 
ae | + By, jar’, +32) oe + 
2 ¥ b . 
+(=) r%s}| soe [40s {3% ay (=) vis} r 


Pipi. A eae or eee 
= 5) v%s| oe {5 ¥% , (z) v5s| 4 


5 fi b 0 7 oad e 
Box |5¥3s + r(4) v's} | ae a” * (Ao3 Y6a + 


A,3Y{3 + B43 Yi3 + Ag, Yo3 + Bos Y 2s + 
A353 Y33 + Bsa Y33)- 


This is a representation of the current generator in a spherical 
volume conductor by the combination of an eccentric dipole, an 
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eccentric quadrupole and an eccentric octopole at (0,0, 5). For 
b = 0 equation (31) becomes the representation of the combination 
of a central dipole, a central quadrupole and a central octopole. 


Approximate Determination of Eccentricity Location. Suppose 
we wish to represent the current generator in a volume conductor 
by a combination of eccentric multipoles lying on the line z= y = 
0 (i.e. 0= 0). The location of the eccentricity 2 = 5 can be ap- 
proximately determined by a sufficient number of surface potential 
measurements. 

Consider the surface potential expression for the case of N = 
S = 3 in equation (31). 


— b 
p(a, 0, ~) = - [sos ¥5, | 5 (=) Ao1 55 


FAoa| Yé> +..,| (32) 
where Ao; is the strength of one of the eccentric dipole compo- 
nents and Age is the strength of one of the eccentric quadrupole 
components, both located at (0, 0, 3). 

Let us represent the same surface potential by the combination 
of central multipoles of strengths Fries Anat ...etc. Then equa- 
tion (31) with 5 = 0 becomes 


= — e he e 
f(a, 6, ~) =a [s4o, Yo1 =f: oat Y6e -—- baal e (33) 


Comparing the coefficients of Y§,; and Y§2 terms in equations 
(32) and (33) we must have 


Ao: = Ao1 
Aoo = 2bAo1 + Aoa (34) 
or 
Wa Aoa - Ao2 f (35) 
2A, 


Hence if Aog = 0, b= Ags /SAcs can be immediately obtained 
whenever the central multipole representation is determined from 
sufficient surface potential measurements (see Yeh and Martinek, 
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1959, equation (57)). If Aoo #0, the value of b’ = Ayo/2Ao1 can 
be considered as the first approximation of an iteration process. 
When the eccentric multipole representation about the point (0, 0, 0% 
is determined from surface potential measurements the value of 
A’o2 can be substituted into equation (35) to obtain the second ap- 
proximation of 6. And so on, until the difference between the 
successive approximations of 5 becomes negligible. 

Such a method of determination for 5b can also be extended to 
comparing other components of the dipole and the quadrupole. 


Conclusions. A method is presented for determining successively 
better eccentric multipole representations of the current generators 
in a homogeneous conducting sphere by measuring surface po- 
tentials at successively increasing numbers of points. For a 
chosen framework, i.e., for chosen points P,, P,,..., and for a 
chosen eccentricity 6, most of the numerical computation can be 
carried out once for all before the measurements used for the 
diagnosis. 

For bodies of non-spherical shapes the formulations hold ap- 
proximately in the sense that the disturbance potentials are ap- 
proximately evaluated according to a spherical boundary. Thus if 
equation (21) is used instead of equation (23) the measuring points 
P,, P.,-.., can be taken on the actual body surface and need not 
necessarily be on a spherical surface. Finally, a method for ap- 
proximate determination of the eccentricity is also presented. 

The method presented in the paper is not only a generalization 
of the central multipole representation of Yeh, Martinek and de 
Beaumont (1958) but also an alternative formulation for multipole 
representation in general. The spherical harmonic expansion used 
here renders a more compact (and non-redundant) formulation than 
the Taylor series expansion. Moreover, a proper choice of the 
eccentricity always reduces the number of higher multipoles neces- 
sary for inclusion. Hence a more accurate representation is pos- 
sible with fewer surface potential measurements. However, the 
Taylor series expansion is more graphic. And it might often be 
desirable to establish the definite correlations between these two 
systems of representations. This will be a topic of further re- 
search and publication. 

The flexibility of the eccentric multipole representation and the 
possibility of the representation of a heart about its own center 
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offer vast opportunities for numerical speculations and experi- 
mental verifications. 
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Schemes are presented for induced synthesis of the dissociable en- 
zyme in which repeated use of the template is made. The role of the 
inducer is to release the repression. A mathematical analysis is carried 
out and expressions are obtained to describe the kinetics of enzyme 
formation. A practical case (penicillinase synthesis) is compared with 
theoretically derived equations by using an analogue computer to simu- 
late an enzyme forming system. A good correlation between theoretical 


and experimental data is obtained. 


In a previous publication (Heinmets and Herschman, 1960), a 
general model for induced enzyme synthesis was proposed and 
mathematical analysis was carried out for nondissociable enzyme 
synthesis. The latter was characterized by the feature that the 
template system was used up while the enzyme was synthesized 
and synthesis stopped when templates were no longer available. 
In a subsequent publication (Heinmets and Herschman, 1961), this 
model was extended to the ‘‘chain synthesis’’ of two enzymes in 
sequence. In this paper we will analyze schemes in which a re- 
peated use is made of the template for the enzyme synthesis. 
That such a mechanism may exist has been shown in the induction 
of penicillinase by penicillin in Bacillus cereus (Pollock,71959): 
It was pointed out in previous publications (Heinmets and Hersch- 
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man, 1960, 1961) that a concept of repression had been proposed 
by Pardee, Jacob and Monod (1958) for the induced enzyme synthe- 
sis. However, since experimental data supporting this view was 
essentially genetic and limited in scope, it was not considered 
feasible to evaluate the kinetics of enzyme formation in a re- 
pressed system, but it was outlined that, on the basis of certain 
premises, the mathematical treatment (single step enzyme synthe- 
sis) for the classical and repressed systems could be considered 
to be similar. 

The role of a repressor in controlling the synthesis of an en- 
zyme has been further elucidated by a recent publication of Gorini 
(1960). In this work a detailed and careful analysis is carried out 
on the formation of ornithine transcarbamylase of Escherichia colt, 
and the antagonism between the substrate and repressor is defi- 
nitely established. This relation permits us to visualize how the 
basic mechanisms involved can be effective in controlling bio- 
synthetic processes in complex metabolic pathways (Pardee, 1959). 

In this paper we will propose two schemes for the single-step 
enzyme synthesis which may offer insight and perhaps an explana- 
tion for the processes involved. Also, a comparative analysis be- 
tween some experimentally observable data and postulated mecha- 
nisms will be carried out. A unified hypothesis for induced and 
repressed enzyme synthesis was proposed by Vogel (1957) in 
which the action of the inducer and repressor is to affect the rate 
of dissociation of a template product (enzyme protein) from its 
template. The role of the repressor is to bind newly formed en- 
zyme protein to its site of synthesis, while induction represents 
the neutralization of a binding effect. The principal weakness of 
this scheme is the assumption that all enzymes have to be ‘‘soluble’”’ 
enzymes, since the formed enzyme has to be dissociated from the 
template. While there are many enzymes which are ‘‘soluble’’ and 
can be isolated from intra- or extracellular fluid medium, there are 
also enzymes which are firmly attached to or are part of the intra- 
cellular structural elements. Furthermore, experimental evidence 
indicates that in some cases a continuous synthesis of RNA is 
essential for induced enzyme formation to occur, while in other 
cases nucleic acid metabolism related to the induced enzyme 
formation is essential only in the initial stages of induction (Par- 
dee, 1954; Spiegelman, Halvorson, and Ben-Ishai, 1955; Creaser, 
1956; Pollock, 1959). In view of such evidence, it can be sug- 
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gested that the process of dissociation of the newly formed en- 
zyme from the template is not the principal function of the inducer 
and repressor. In the following we will propose schemes which 
would indicate a different role for the inducer and repressor. 


Schematic Formulation* 


Basic schemes for induced enzyme synthesis have to be formu- 
lated on the basis of available experimental data. Otherwise they 
are entirely arbitrary and do not serve any useful purpose. On the 
other hand, experimental data in the literature is extensive in 
number but heterogeneous in character so that no coherent picture 
may be obtained by direct interpretation. Consequently, schemes 
postulated on the basis of such experimental information have to 
possess some speculative characteristics. With this in mind, we 
will proceed to set up some schemes for induced enzyme synthesis. 

Since the phenomenon of repression seems, at least in principle, 
fairly well established (Pardee, Jacob, and Monod, 1958; Gorini, 
1960), we will proceed on the basic premise that the principal role 
of the inducer is to release the repression, and the genetically 
inherent template pattern provides configuration specifity for the 
formed enzyme. The problem obviously may be more complex when 
a system of enzymes is considered, such as in the case of sequen- 
tial induction, but at the present time we will limit ourselves to a 
single-step enzyme synthesis. While the data in the literature 
concerning the induced enzyme synthesis is voluminous for cases 
in which induction is carried out in the presence of the inducer, 
the induction experiments for cases in which the inducer ts re- 
moved from the medium are limited. For the latter, the principal 
source is a series of publications by Pollock on penicillinase in- 
duction of B. cereus, and the subject has been reviewed by him 
(Pollock, 1959). 

In most cases, when the inducer is removed from the cellular 
environment, the rate of enzyme synthesis is reduced or it stops 
immediately (Cohn, 1957; Pollock, 1959). In this paper, we are 
interested in the case in which enzyme induction can proceed after 
the removal of the inducer, which seems to imply that a repeated 


*Since detailed information on induced enzyme synthesis is not cur 
rently available, it is understood that this schematic formulation repre- 
sents only an overall process. As such it probes relations between 
general operational units and describes a gross behavior of the system. 


280 F. HEINMETS AND A. HERSCHMAN 


use of the template is made in the process of enzyme synthesis. 
Deadaption studies indicate indeed that cells may continue to pro- 
duce an enzyme for several generations after removal of the in- 
ducer, and it is established that the original free inducer is not 
present during the synthesis (Pollock, 1950, 1952; Rickenberg, 
Yanofsky, and Bonner, 1953; Neidhardt and Magasinik, 1956). We 
exclude the cases in which enzyme synthesis may continue in the 
absence of an external inducer but where there is evidence of 
intracellular accumulation of inducer (Monod, 1956; Cohn, 1957; 
Pardee, 1957). 

In general, enzyme synthesis may start ‘‘practically immediately’’ 
after the addition of the inducer or there may be a considerable 
lag before a measurable quantity of enzyme is formed (Monod, 1956; 
Cohn, 1957; Pollock, 1959). Since it is considered that the role 
of the inducer is to release the repression, then it appears that in 
the case of immediate synthesis the templates of the enzyme form- 
ing system are immediately operational after interaction with the 
inducer. However, synthesis with prolonged lag phase may imply 
that the template system is not operational and becomes so only 
after definite metabolic activity. Such is the case of penicillinase 
formation (Pollock, 1950, 1952; Pollock and Perret, 1951). 

In order to analyze enzyme synthesis with ‘‘small lag’’ and 
‘fextended lag,’’ two basic schemes are presented. Scheme I 
deals with synthesis in which the repressed template becomes 
operational after interaction with the inducer. Since we are study- 
ing the case of single step synthesis, it is assumed that the in- 
ducer is not a substrate. The case in which the inducer is a sub- 
strate for the newly formed enzyme in a chain synthesis and in 
which the end product acts as a repressor in the initial step of 
enzyme formation is currently being carried out, but it is a subject 
for future publication. Scheme II deals with synthesis with lag 
period, and it will be compared with experimental data published 
in the literature on penicillin production. The latter is the only 
case in which sufficient systematic and consistent experimental 
data is available for an induced enzyme synthesis. While - 
galactosidase is an enzyme on which most extensive research has 
been performed, it is evident that there is a permeability problem 
for the inducer (Cohn, 1957; Pollock, 1959; Heinmets and Hersch- 
man, 1961) and consequently data published on f-galactosidase 
formation cannot at the present time be used for theoretical analy- 
Sis of a single step enzyme synthesis. 
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Synthesis of the Dissociable Enzyme 
I. Synthesis with Short Lag 


When the enzyme dissociates itself from the template, after the 
synthesis, the latter can be visualized as being able to participate 
in the enzyme formation once more. Such a repeated use of tem- 
plate for synthesis can be analyzed on the basis of postulated 
mechanisms, presented in Scheme I. 


Scheme I 


T + Set [Ts] + P—2> [TS - P] > [Ts] + E 
es SS ee ee ee = | 
Notations: 

1) Letters indicating individual components of the scheme are 
also used to indicate the concentrations; complexes of components 
are enclosed in brackets. 

2) Concentrations of complexes, which have been formed by 
interaction of components, are indicated by the letters in 
parentheses. . 

3) Multiplication product of two components is indicated by two 
letters, which represent the components. 


T-repressed template E-dissociable enzyme 
S-inducer (not a substrate) [7'S]-active template (B) 
P-pool for the synthesis [TS — P]-complex enzyme (C) 


k, | rate constants for formation and dissociation of inducer 

‘| and template complex 

ky- represents the rate of a multiple step complex formation; 
it is the average rate of the process when the last com- 
ponent of the pool has interacted with the template. 

k3- dissociation constant for enzyme separation 


In Scheme I the reaction in the first step can be considered to be 
reversible. Consider the following reactions: 


ky 
T+S= 6B, 
km, 


k 
B+P— 4 U, 


kg 
C— B+kFE, 
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which imply the following mass-action differential equations: 


-- k ST + k-,B, 


=k ST ~ kB - kaBP + kgC, (1) 


de 
ir SY ie OY 2 ao) Ve 
dt 2 3 


Since the kinetics of enzyme synthesis will depend on the con- 
centrations of the components of the system, various combinations 
will be analyzed. 

A. Constant Pool. Such a condition could exist in a metabolic 
system, where a feed-back mechanism is operative and a terminal 
substrate concentration is kept at constant level. There is experi- 
mental evidence that such a feed-back mechanism may be operative 
in metabolic reactions and synthetic processes (Yates and Pardee, 
1956; Umbarger and Brown, 1958). Hypothetically, this type of 
synthesis could also occur in conditions where an abundant pool 
is available, but the template is.limited. I.e., this could occur in 
a case in which the synthesis of an enzyme is induced and the 
amino acid pool is continuously replenished, but nucleic acid 
formation is limited. 

We assume that inducer concentration is constant during the en- 
zyme synthesis. We have consequently 


S = constant, 
fe Ca B ae C= Te5 


(2) 


where 7 is the initial value of 7, since B and C are assumed to 
be zero initially. In the case of constant pool concentration, P = 
Py always and the set of differential equations considered be- 
comes linear and has the following immediate solutions in terms of 
the convenient parameters, 


K =k, + kaPo, total rate constant of intermediate stage; 
k = kxk,Po/K, relative rate constant of intermediate stage; 
Pia Tears ks (3) 


K(k,S - 
B = (ky To/K) E erin Ft) piecing, a Oi eel 
halt. 5 SR) kg (k,S — K) ; 


ES 
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C = (kaPoTo/K)[1 + (K/h, S — K)e™*15* — (k,8/k,S ~ K)e~K*}; 
E = kT) (¢ —1/k,S —~1/K) + 


(kT) /k,S — K)l(k,S/K)e~** ~ (K/k,S)e~*s5*]., 


It has been pointed out previously (Heinmets and Herschman, 1960) 
that in the process of enzyme synthesis we can assume that k, >> 
ko and k, >> k 3. Since inducer and pool concentration levels are 
comparable, consequently &,S >> K and enzyme synthesis at times 
much less than 1/X,S are not observable. The initial short time 
solutions may be obtained in the range 


goa 
K k,8 
which are 
he Toe kyst 
Boe Pot es" hoPolt, 
) { Cie 2F0 ; (4) 
GC. = kyPo tia te 
The long time solutions, where ¢ > 1/K are 
T=0 
Be k3To 1 4. koPo e Kt 
K ks 
C= kaPoTo (4 - er) (5) 


K 
E=kT> aa? 
age 


Since the pool is constant, there is an unlimited production of the 
enzyme. At large time values the enzyme is formed essentially at 
a linear rate. 

A graphical presentation can be made (discarding an extremely 
short time range) by using short and long time solutions. Enzyme 
concentration at various time values can be determined and a 
graph constructed. Two curves can be drawn through the points, 
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which have been obtained from short and long time solutions. A 
curve for the entire time range is obtained, when short and long 
time curves are connected with a smooth line. In Figure 1 en- 
zyme formation is presented as a function of time. It is evident 
that curves obtained from short and long time solutions submerge 
at the range of Kt value 0.5 and the general curve for enzyme for- 
mation can be drawn quite accurately. As matter of fact, for 
practical purposes the long time solution alone seems sufficient 
to represent enzyme formation as a function of time. 


ENZYME 
ey 


0 A 
O | 2 3 4 5 
TIME 


FIGURE 1, Enzyme formation with unlimited pool. Abscissa: Kt, time 
limit. Ordinate: # values in graph should be multiplied by the factor 
kT 9/K to obtain absolute concentrations. 

ee ee used for graph: @®— short time — EF = S(kTo /K)(Kt)*. ze 
long time — FE = (k79/K)(tK — 1+ e ¥*), 


B. Vanishing Pool. The pool is considered to be finite and is 
depleted during the enzyme synthesis. Such a condition could 
exist in a certain growth phase of the cell where there is no build- 
up of pool components during the enzyme synthesis, or in an ex- 
perimental system where pool level is controlled and limited. In 
such case 


dP 

tm whe BR: 

dt : 

dE 6) 
—=k,C, ( 
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Thus: P + E + C = Po, where Py is initial amount of pool. 

We shall assume again k_,/k,;S = 0. In the case of a finite pool, 
Significant changes in the rate of enzyme formation occur only for 
times which are much larger than 1/k,S, i.e., when the template is 
already in the complex form [7S]. At times shorter than 1/%,S, 
neither intermediates nor enzyme formation can be measured by 
conventional methods, and consequently we will analyze enzyme 
synthesis at times which are longer than 1/4,S. Thus, 


‘emus 
B+C2=Tp», 

PRR Ces Ph (7) 
LIED oe) My 

dt dt 


The equations are now non-linear and solutions are difficult, if 
not impossible, to obtain analytically. Wowever, one can find an 
immediate first integral which makes the overall surveillance of 
the system simpler. From the rate equations for the enzyme and 
pool, one has: 


Syeiate and aioe (8) 
et an ky dt 
Substituting these into B+ C=T) one obtains, upon integration, 
P = Py exp (ka E/ks — kyTot). (9) 
Now, initially (times just after the decay of T can be considered 
final) 
CwE=0 and B=T),P =P, 


but after sufficiently long time, we have 


baer, Bat, and. CP = 0: 


Thus, if we define P, = Po exp (k2P)/k;), we see that P starts 
from Py as an exponential decay with a mean life of 1/k, To and is 
asymptotic to another exponential decay which started at P, but 
with the same mean life. 

Somewhere in the mid-region, when P lies between the two 
curves, B has a minimum and C a maximum, viz., k2PB - ka Ge= 0: 
In this region, d*E/dt® = 0, and the curve of £ as a function of 


time is straight—E appears linear. 
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The actual differential equation for the enzyme is transcendental 


and not obviously soluble. It is: 
dE /dt +k,E = k,Po(1- exp (kgE/k3 - kaTot)). (10) 


An approximate solution may be found, however, in any region of 
interest, by replacing the E in the argument of the exponential by 
some suitably obtained constant. Calling k2//k 3 in the argument 


A, one has: 


Bie Py (Le en ee (11) 


where we have put P, = (k3P)e4/k3 — k2T>). From this value of 
E one may obtain approximately expressions for the other concen- 
trations as, 


P = Po exp (A - keaTo?), (12) 
B=T-—C where C=P,-P-E. 


If A is small, one may use these results to obtain the approximate 
time of the linearity of E by setting its second derivative equal to 
zero, whence, if Eo is assumed to be its value and fp the time, we 
obtain: 


to = (log kea/koT9)/(k3 — koTo) + Eo/ks3To. (13) 


For a large range of approximately equal intermediate rates, the 
major term in ¢g is about 2/(k3 + koTo). 
For small values of the time, one has for £, 


E = kgksPo Tot?/2 — kak 2To (To + 2Po)t*/64+.... (4) 


This may be compared with the case for the constant pool which 
was 


ih = kakgPo To t7/2 — kak2Po (Po + Ton ss + 


and one sees that the initial behavior of the systems is not very 
different. For large values of the time one may estimate the be- 
havior by putting k2Po/k 3 for the constant A in the above analysis. 

One convenient method of plotting the graph for E in the inter- 
mediate zone makes use of the fact that C never becomes very 
large so that dC/dt can be considered negligible over quite a large 
range of values (this is exactly true over the linear range of E). 
One has from the appropriate equations: 
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Bis zV(To + E- Po + k3/kg)? + 4ka(Po - E)/ko - 
1(T) +E Py + kg/ha). (15) 


From this result one may tabulate P as a function of FE for various 
values of the parameters and then substitute the joint values of P 
and £ into: 


P = Py exp (koE/kg - kaTot) (16) 


and solve for ¢ as a function of FE, the inversion of which is the 
desired result. This method is exact for the linear region of # and 
also is quite accurate for larger values of ¢, since dC/dt is asymp- 
totically zero. Therefore this method and the power series for 
small values allows one to obtain a numerical solution with little 
difficulty (Figure 2). The system can also be programmed for the 


ENZYME ~ 


“ 
oe Pome 2 eee 


FIGURE 2. Enzyme formation with finite pool as a function of time, 
z= kgt-time limit. L-enzyme concentration (Pg = To = k3/ka2). @-solu- 
tion for small z values. o-solution for large 2 values. «z-solution for 


intermediate 2 values. 
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analogue computer, as will be seen in the next section where we 
consider a somewhat different case. 


Il. Synthesis with Extended Lag 


Observance of lag in induced enzyme synthesis is commonplace 
and the length of lag may be highly variable (Monod, 1956; Cohn, 
1957; Pollock, 1959). While lag may be caused by many factors, 
the essential condition for controlled enzyme synthesis is the 
absence of limiting factors so that the effect of the variation of a 
single parameter may be studied. Pollock (1952) has studied 
penicillinase formation in a series of careful experiments and he 
concludes that induced penicillinase synthesis of B. cereus can 
be divided into the three following phases: (1) primary interaction 
between penicillin (inducer) and receptor site of the cell; (2) latent 
phase or specific lag phase during which the rate of penicillinase 
formation increases; (3) a linear or active phase during which the 
enzyme is formed at relatively constant rate. The primary phase 
does not depend on cell metabolism and seems to be a process of 
complex formation between a receptor and an inducer. Since 
penicillin is rapidly taken up by the cell, this seems to suggest 
that the reaction favors complex formation, but probably cannot be 
considered entirely irreversible. At present it is not certain what 
the latent phase represents in enzyme synthesis but these proc- 
esses seem to depend on cellular metabolism. This phase repre- 
sents an irreversible process. On the basis of Pollock’s con- 
sideration, a scheme can be constructed and an attempt made to 
compare experimental data and kinetic data derived on the basis 
of mathematical calculations. While Scheme I represents a system 
in which synthesis starts rapidly after the inducer complexes with 
the repressed template, this has to be modified to include a latent 
phase. Inclusion of a latent phase in Scheme I yields a modified 
scheme which has been indicated as Scheme II. 


Scheme II 
ky * Fa ky ks 
Net re [7S]"—+ [TS + P] —+[TS - P]—> [TS] + F 
Lage / Suton 


The following additions are added to the notations: a short term 
complex [7'S*] is formed from the components of 7 and S; the con- 
centration of [7'S*] is indicated by B*. 
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Since enzyme synthesis was studied in logarithmically growing 
cells, the pool can be considered to be constant and the methods 
of A rather than B apply. However, in this case, since Pollock 
also used small values of S, we may not assume S to be constant. 

The rate equations for this scheme are: 


dT /dt = dS/dt = -k,TS + k-,B*, 
dB*/dt = -dT/dt ~ k,B*, 


dB/dt = k,B* —k,B + k,C, (17) 
dC/dt =k,B— k,C, 
dE/dt = k,C, 


where we have absorbed the constant value of P into kz. These 
equations have two immediate integrals, viz., 


ee et ee ee yO (18) 


where Sy and 7, represent the respective initial values and all 
other concentrations are assumed to be zero initially. For the 
long time behavior, we set all the rates except that of the enzyme 
formation equal to zero, and we see that we have three cases: 


(a) Sy < Ty, S—*0, T—+To - Sy, dE/dt—>¥S,. 
ib) Sp)> Tg, 1 —=0, S—S,) -—1,, dE/di—>xkT,. (19) 
(c) 85 = 1p," 5 = 7 — 0; dE /dt does not quite 


become constant. 


The constant # introduced above is defined as k3k4/(k3 + k4), and 
the behavior of dE/dt in case (c) is not apparent simply from the 
procedures outlined. 

To gain a better insight into the general behavior of the system 
when the 7'S formation is rapid compared with the intermediate 
steps, it is convenient to use the integrals of the equations to 
define a quantity U as: 


so that we have: 
TS = (To ae S_)U + (Uo — ip oe with Uo = T) So /(To + So). (21) 


Now, if we may neglect (Uo - U)? in this equation, we obtain a 
linear set which can be integrated in closed form, e.g., the equa- 


tion for U is: 
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d°U/dt? + (K + k-, + kg)dU/dt + kKU = 0, (22) 
where we have defined K=k,(So +7). The integral of this 
equation is straightforward and has the form of: 
U=Uje "3 Use OF (23) 


where the A’s are the two roots of \7 + (K + k-,; + kg)A + Kkg = 0, 
and 


U, = (K - Ago /(Ay - Az), Ug = (Ay — K)Uo /(Ay - A2)- 


In the limit where K is very large compared with &, these can be 
simplified in terms of k-= #-, and K;=K + k-as: 


dM, = Ke(1 + Rha /K#), dg = Kheo(h - kka/KY)/Ks, (24) 


Uy = KUo(1 ~ 2k_ko/K4”)/K +, Uz = k_Up(1 + 2Kk2/Ks?)/Ki, 


so that we see that U consists of a rapidly decaying part and a 
slowly decaying part, the latter existing only because of the re- 
versibility of the first step in the process. Once U is found, the 
remainder of the terms may be found by direct integration, e.g., B* 
is given by: 


Be = KUplan 3 6. fide ha): (25) 


Since B+ C may be obtained directly from B* and U one may 
easily obtain an equation for C and integrate it and from that ob- 
tain an equation for B and also for dE/dt. This latter is, in terms 
of k=k, +k, and k’ defined earlier, 


Kkge™** 2 Ake Aat 
(& — Ag)(Ay-*%) (Ay — Aa)(A - Ag) 


-r 
ae. (26) 
(Ay — Aa)(A, - *) 


dE /dt = k’Up ! + 


The integral of this is readily found and is seen to contain a 
linear term, k(t - t9) and three transients. On the assumption 
that K is large whereas kg and & are small and about of equal 
magnitude, one obtains: 


E = KUo(t - to + (k7e7 2 — Ze **) /kda(k — Ad)l, 
with: (27) 
to = 1/k + 1/dg = 1/k + (1/kg) (1+ k-/K), 
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where we have completely neglected the short term transient and 
terms in ¢9 contributed by it (exactly = 1/K). We see that the re- 
versibility of the first step makes itself felt only in the lag time 
t) and not in the slope of the enzyme production curve. In Figure 
3 this slope, obtained from k’U,) and from the infinite time limits 
considered earlier, and those obtained from reading curves ob- 
tained from an analogue computer programmed (Figure 5) for the 
exact non-linear equations, are displayed. In obtaining these 
curves an effort was made, by means of the theory outlined so far, 
to obtain a consistent set of units with which to describe Pollock’s 
works (1950, 1951, 1952). Figure 3 of Pollock’s paper (1950) can 
be considered to give the asymptotic enzyme rate for very low So 
and hence the slope of this rate as a function of So should give k’ 
directly. This is found to be 23,200 pl CO./unit/hr.?. Presuma- 
bly the slopes of the E vs. ¢ curves are too small since not enough 
time has elapsed after only one hour to adequately estimate them, 
but investigation of Figure 3 of this paper shows that, in the very 
small Sp limit, very little error is introduced by inadequate waiting 
as the two limiting curves are almost coincident. Figure 6A of 
Pollock’s paper (1952) again gives the rate but now for very small 


To which gives us k’T7 as 660 pl COo/ml/hr.? when 0.05 mg. of cells 
were used. This implies that there are 0.57 units of 7) in every 


25 INFINITE TIME 
“ ai 
io 
zon 
Sis 
a=). 
os 
Hz — COMPUTER 
ao -----LINEAR APPROXIMATION 
wo 2 POLLOCK (Fig.3) 
were & POLLOCK (Fig.6B) 
ar 
a 


O 5 10 15 20 


PENICILLIN CONC. (Volts) 


FIGURE 3. Rate of production of penicillinase as a function of peni- 
cillin concentration. Penicillin units per ml. are converted into volts 
for analogue computer programming. Conversion ratio: abscissa in volts, 
1 v = .057 units/ml.; ordinate in volts/sec. = .415 units/ml./hr. 
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FIGURE 4, Penicillinase synthesis as function of time. Comparison 
between experimental points (Pollock, 1952) and analogue computer 
curve from Scheme II. 


mg. of cellular material. The values of the rate constants are not 
so easily obtained. One may estimate ¢) from many of the curves 
given by Pollock and find it between 0.20 and 0.25 hrs. However, 
the analytic expression obtained for ¢) in terms of the rate con- 
stants is probably valid only for the case of Figure 3 (¢) = 0.21 
hr.) and in this case, one does not know enough,about the relative 
values of the k,, k-,, etc., to obtain each of them from a single 
expression. Extrapolating the validity of our linear approximation 
to higher values of So, we may use Figure 6A (Pollock, 1952) and 
the assumption (of a heuristic nature) that: kg = k3 = k4 to obtain 
ko = 7.2/hr., k’ =8.6/hr. This yields the conversion factor of 
1.55 x 1074 units of E producing 1 pl CO2/hr. In the machine cal- 
culation To was taken to be 5 v, which for 0.5 mg. of cellular mate- 
rial implies that one unit will be represented by 17.5 v. Also the 
constants kg, etc., were set at 1/sec., so that 1 sec. on the ma- 
chine’s scale represented 8.4 min. or that 1 hr. on the experimental 
curves is represented by 7.2 sec. on the machine curves. Thus, for 
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example, Figure 6B of Pollock (1950) gives arate which, in Pollock’s 
units, is 5650; we may convert this to our standard base (all mass 
in units of Penicillin equivalent) as 0 875 units/hr. and for the 
purpose of comparison with the machine’s calculations as 2.12 
v/sec. This point is indicated in Figure 8 of this paper along 
with similar points taken from Pollock’s Figure 3 (1950). Un- 
fortunately, since Pollock did not use any values of S intermediate 
between 0.1 and 1.0, we can show no points in this region. 

In Fig. 4 we show one of the machine curves obtained with the 
constants: 


ky = ks =k, = 1/sec; k-, = 5/sec; k, = 5/sec-v; 
Ty =5 Vv, Sq = 16 2. 


On the curve are indicated, as experimental points, Pollock’s 
Figure 6B which, as we have seen, would represent 17.5 v for So, 
but the difference is not considered significant since there is 


ANALOGUE COMPUTER CIRCUIT 
FOR SCHEME I 


FIGURE 5. Symbols used are those of Jackson (1960). 1. Sign changer; 
output voltage = negative of input. 2. Summer; output = negative of sum 
of inputs. 3. Scale factor; output = constant times input. 4, Multiplier; 
output voltage = product of input voltages, scale to be fixed. 5. Inte- 
grator; output = initial — integral of input from zero; time scale = RC. 
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little change of shape with Sp in this region. The very good fit 
of this data is the best confirmation of this approach since the 
scale factors, etc., which were used were obtained exclusively 
from Pollock’s other curves and independently of this one. Modi- 
fication of the machine values of &, and k-, would also improve 
the fit at the lower points, but this is not considered worthwhile 
since more accurate experimental measurements would also be 
necessary. 


Summary 


A hypothetical formulation of the synthesis of the dissociable 
enzyme has been proposed. Mathematical calculations have been 
carried out assuming various interrelations between the opera- 
tional parameters. 

One of the proposed schemes has been compared with experi- 
mental data which has been obtained by measurements of induced 
penicillinase synthesis of B. cereus. A good correlation between 
theoretical and experimental data is obtained. 


We wish to acknowledge the invaluable help of Prof. D. W. Howe 
in obtaining the machine results. 
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Given three primary events, occurrences or nonoccurrences of two 
response classes and nonoccurrence of both response classes, with 
their respective probabilities to be known, and assuming the first two 
events are disjoint, this paper deduces the error equation of the incorrect 
response and the latency equation of the correct response under the con- 
dition of correlated reward situation. 


Suppose there are only two response classes under experimental 
control with the nonoccurrence of both response classes con- 
stituting the third event. Consequently, the probability of occur- 
rence of each response class at the beginning of the experiment 
proper is known. It will be assumed that the events associated 
with the two response classes and the third event are pairwise 
disjoint. This is a typical case of the instrumental or trial-and- 
error learning situation. 

Suppose further that pretraining has been given so that a re- 
warded response is more likely to occur than a response which is 
not rewarded. Now let the response class F,, with larger proba- 
bility of occurrence be extinguished by the withdrawal of reward 
and the response class PR, be strengthened by the presentation of 
reward. The problem considered in this paper is that of finding 
the reaction latency of the now correct response F,. By latency 
is meant the length of time elapsing from the onset of the stimulus 
to the onset of the associated reaction (Hull, 1943). In other 
words, it refers to the time interval between the moment of ex- 
posing an animal to the experimental situation and the evocation 
of the response under consideration. An exposure of an animal to 
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an experimental situation means that the animal is in a situation 
such that it is ready and free to make the response to be recorded. 

Assume that the acquisition and the extinction processes of the 
correct and incorrect responses are characterized respectively by 
the following equations (Estes, 1950): 


where n and m represent acquisition and extinction trials respec- 
tively, and d is a constant. Thus, p, is the probability of occur- 
rence of the rewarded response exactly after n acquisition trials, 
implying that the probability of occurrence of the response is zero 
before the start of the acquisition trials. Likewise, q,, is the 
probability of occurrence of the nonrewarded response after m 
extinction trials. 

The notions of correct and incorrect responses are defined by a 
drive-shift function. The drive-shift function has a collection of 
response classes as its domain, and the range of the function 
consists of the amount of drive shift. Most empirical investiga- 
tions done so far center upon the simplest type of this function. 
That is, the range of the drive-shift function includes only two 
values, a fixed amount of drive shift and no shift at all. A re- 
sponse class is correct or incorrect insofar as the drive-shift 
function defined for this response class has a negative drive shift 
orno shift. A negative drive shift is usually understood as reward. 

For the sake of simplicity, the present paper deals only with the 
simplest type of drive-shift function, i.e., a case involving two 
category responses. To attain mathematical rigor the third event 
of the nonoccurrence of both response classes will be _ intro- 
duced in the sequel. Common observation shows that when a re- 
Sponse with a stronger tendency is weakened to a certain stage, 
uncertainty on the part of the subject is manifested by no response 
for a very short time interval. For the reason of convenience this 
time interval is assumed to be so small that it does not affect the 
latency equation to be derived. 

With the foregoing condition of pretraining, on the first trial, the 
incorrect response class F,, is likely to occur, or, more precisely, 
the probability of occurrence of F,, is larger than that of the now 
correct response #,. With the evocation of F,,, it undergoes ex- 
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tinction, i.e., the probability associated with the occurrence of lee 
decreases. If the decreased probability of occurrence of R,, is 
still larger than that of R., R,, is evoked on the second trial. The 
weakening process of F,, continues until the probability of occur- 
rence /,, is equal to or less than that of R,. When the probability 
of occurrence of F,, is decreased, that of R, is assumed to be un- 
changed. This is a case of the stochastically independent events 
associated with the correct and incorrect response classes. For 
the sake of simplicity the present analysis is restricted to this 
case. Then, in this case, the probability of occurrence of the 
third event, nonoccurrence of both response classes, is increased 
accordingly. 

The argument requires the use of seemingly conflicting hypothe- 
ses: (a) that events have associated probabilities; (b) that actual 
events are all or none with respect to a probability scale. The 
difficulty disappears if the probability of an event under the 
second hypothesis is understood as expectation. For this purpose, 
define an indicator function 7, by assigning the value one if a 
point falls on the event A, and zero otherwise. In other words, if 
the event A happens, the indicator function has the value one, and 
zero if the event A does not occur. Since /, is a random variable 
the expectation El, is calculated to yield the probability of A. 
Therefore PA,, and PA, can be compared on the basis of under- 
standing them as indices of expectation, the subscripts referring 
to correct and incorrect responses. When PA,, becomes equal to 
or less than PA,, the calculated n; to be explained later can be 
regarded as the expected number of incorrect responses, and the 
point at which PA,, is equal to PA, may be considered as the first 
expected occurrence of the correct response on the 7th occasion. 

With the subsequent occurrence of RF, followed by reward, the 
probability of occurrence of F,,, increases by the known process of 
generalization or spontaneous recovery, depending upon the nature 
of the experimental setup. Suppose the increased probability of 
occurrence of Ff, due to the reward to FR, is a fraction of the 
diminished part of the probability of occurrence of &,, which has 
resulted from a series of its extinction trials. That fraction will 
be indicated by 9. In other words, if the probability of occurrence 
of R,, at the beginning of the trials is e ?"o and that of RF, is 
Ae aie ce thats 60978 > 1. 2.456. Since & 2013 previously, 
correct, 79 should be obtained from the equation ego vegies: 
with n’ known. Then after the first appearance of FR, the proba- 
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since the probability of occurrence of R,, decreases down to the 
level of 1-e ?"° before the occurrence of R,. Therefore the 
diminished part of the probability of occurrence of F,, is Os Pe 
(1-e 7"), Let this quantity multiplied by @ be denoted by 4,. 
More generally, let 6; indicate the added probability of #,, induced 
by the zth occurrence of R,. The calculation of 52 can be visual- 
ized from Figure 1. From the left to the right of Figure 1, there 
are a series of descending curves, each of which describes an ex- 
tinction process of R,,. The points of discontinuity in the de- 
scending curve is caused by the generalization or spontaneous 


Descending curves 


Probability 


Ascending curve 


Trials 


FIGURE 1. Calculation of error equation. 


recovery effect of reward to R,. The ascending curve manifests 
the simple acquisition process of R,. After the first point of dis- 
continuity, the second extinction curve comes down until it reaches 
the point, 1- e~?"e*, which is the probability of occurrence of 
f, at that moment. Note that R, has just occurred once. There- 
fore the net decrease of the second descending curve is 


By ~ [(L — e=aemot 4) — (1 = 7 89y], 
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Thus, 5;, b2,..., can be written out in the following manner: 


b,=[e ¢"0 (Ve 5""*)\]6 
b2= [6,- (7, - r.)l0 
6; = [bj-1 - (ri, — 7;3_2)19, etc., 
where 
rs te go fina ti) and Moa tis = e ett ied e 1). 


By recurrent substitution, 


b; = (Qe7~¢7o = 1)@! “is e 2"(e4 &, tye tet + 


Aa 1G akan thie, peices 
The quantity in the last bracket is a term of a special type of con- 
volution of two sequences, {6‘} and {e '%}. Since e? is always 
less than one for any d positive, and @ is fairly large, if e?% is 
approximated by 6* with & being a positive integer, the two se- 
quences will have the same variable. 

Now the first task is to find the number of extinguished trials 
between any two consecutive evocations of the correct responses, 
i.e., the number of trials covered in each descending curve of 
Figure 1. Let the number of trials covered in the first descending 
curve be denoted by n,, etc. Note that n here is a function de- 
fined on a set of positive integers such that n; indicates a func- 
tional value. The in 7; refers to the ith descending curve. Also 
note that the z in n; is the same as the z in the 7th occurrence of 
the correct response R,. Figure 1 illustrates the manner in which 
the descending and ascending curves are equated for the calcula- 
tion of the number of unrewarded trials, n;, occurring in each un- 
interrupted sequence between two correct responses. For this 
purpose, raise the starting point of each descending curve to unity. 
The amount raised will be denoted by a;. The z in a; is counted 
from the second descending curve. Therefore it is the same as the 


Pin... 1 huss 
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where 
ape — pel ca” Vena 
Therefore, 
Che eel ee 
and 
n; = log (1-3,;) ‘7%. (1) 


Equation (1) represents a decreasing error curve with respect to 
R,,- One advantage in the above derivation is its discrete charac- 
ter. The actual error curve can be drawn by assuming a certain 
value for the recovery coefficient 6. The recovery coefficient on 
the other hand is a function of the experimental conditions in- 
fluencing the degree of generalization or spontaneous recovery. 
After an evocation of correct response, if the experiment is inter- 
rupted for a while, spontaneous recovery will play a greater role 
in determining the recovery coefficient. On the other hand, if the 
experiment is carried out without interruption, i.e., the trials con- 
tinue immediately after an emission of correct response, the 
generalization factor will be more important. The actual determi- 
nation of 6 as a function of these two factors should be done by 
empirical investigation. 

Once the equation of error is deduced, the latency equation is 
relatively simple. Assuming F,, and R, have the same mean dura- 
tion time A, the latency of the ith correct response ¢; can be ex- 
pressed in the following equation. 


tf, = An, + 1¥= ACL Flog (P~ By). (2) 


Consequently, the connections have been established among three 
behavioral measures, probability of occurrence, error, and latency. 

In a correlated reward situation, i.e., the occurrence of Ry is 
followed by the withdrawal of reward and that of R, followed by 
reward, the latency of R, can be derived from the probabilities of 
occurrence of k,, and F,. This correlated reward assumption is 
crucial in linking probability and latency measures. 

It is the time to gather corollaries. The procedure is to relax or 
eliminate some of the conditions and to see the consequences. 


ee 


= © 
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First, the homogeneity assumption of Ff, and FR, can be dropped. 
The duration of F,, can be different from that of R.. The result is 
just a minor modification in the latency equation. Second, the 
equations of acquisition and extinction processes can be arbitrary. 
The idea inherent in the derivation remains the same. 


Most of this paper was completed during the author’s study at 
the University of Illinois as a graduate fellow. The author is also 
grateful for Dr. H. D. Landahl’s valuable comments. 
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Freese’s Hypothesis states that a single specific alteration in the 
Sequence of nucleotides of an information-bearing DNA molecule results 
in a specific mutational effect. Within the framework of the DNA-protein 
coding problem developed elsewhere, and assuming the quasi-ergodicity 
of the general coding process, it is shown that Freese’s Hypothesis 
allows us to derive expressions for the length of the smallest mutable 
DNA molecule and to obtain a bound for the maximal number of allelic 
molecules of fixed length. To illustrate these ideas, calculations are 
carried out on appropriate data from bacteriophage and man, and the re- 
sults are shown to differ by a factor of 10 (modulo the rather crude ap- 
proximations used). It is further shown that, if p(V) and O(N) are re- 
spectively the number of information-bearing words of length WM in a 
given code and the number of words of length WN, then the number 
lim p(V)/@(N) depends sensitively on the parameter € which specifies 
Noo 


the given code. ‘The implications of this result for the spontaneous 
aggregation of a sufficient number of information-bearing words to charac- 
terize an organism are discussed. 


I. Introduction. Recent work of Freese (1959a, 1959b) has 
shown that many mutational events in bacteriophage can be in- 
terpreted as implying that an alteration of a single nucleotide pair 
in the phage DNA is responsible for a specific change in the 
activity of the genetic material. We shall henceforth refer to this 
interpretation of the experimental facts as Freese’s Hypothesis. 
It is the purpose of the present note to investigate certain impli- 
cations of this hypothesis for the general problem of DNA-protein 
coding, utilizing the framework for the coding problem which we 
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have developed elsewhere (Rosen, 1959a, 1959b, 1960). We shall 
assume that the reader is familiar with this work. 

In greater detail, section II is devoted to a discussion of the 
relation between the structure of a given code and the possibility 
that a substitution occurring in a word of the code will again yield 
a word in the code. We are led in this manner to certain predic- 
tions which can in principle be verified experimentally, thereby 
rendering our formulation of the coding problem (and in particular our 
hypothesis of ergodicity) susceptible to direct empirical tests. 
Section III will serve as a mathematical appendix, in which the 
discussion of Section II is employed to derive some further results 
of interest in the present context. The mathematical methods em- 
ployed in the present note are of a different character than those 
utilized heretofore, being of a predominately combinatorial and 
number-theoretic rather than of an abstract algebraic nature. 


Il. Implications of Freese’s Hypothesis for the Coding Problem. 
It will be recalled that in our previous work (Rosen, op. cz.) we 
formulated the DNA-protein coding problem in the following manner: 
each admissible code is obtained by specifying (a) a monomorphism 
f: Neg Ny, where Ny, Igo represent the free monoids generated 
by sets containing four and twenty elements respectively, and 
(b) a definite quasi-ergodic submonoid E(q;, 7) C Deo (we recall 
that E(q;, 7) is specified by choosing numbers g;, 7=1,..., 20 
such that 0<qg,;<1 and %q;=1, and a number 7 > 0; E(q;, 7) is 
then the set of all words w in Igo such that |A,(w)/Aw) - ¢;| <7, 
@=1,...,20). The monoids It,, Ngo represent the totality of 
possible DNA molecules and polypeptide chains respectively, and 
it is assumed that a necessary condition for a polypeptide chain w 
to be codable in the given code is that w shall belong to E(q;, 7). 
The word f(w) is taken to correspond to the DNA molecule which 
codes w, and we have shown that the submonoid f(E£(q;, n)) C M4 
is quasi-ergodic in ‘4; we may therefore denote this submonoid, 
which corresponds to the totality of DNA molecules that carry 
genetic information under the given code, as E(p;, €), where p;, € 
have meanings analogous to g;, 7, and nowi=1,..., 4. 

Let us suppose that w is an element of E(p;, €) and construct a 
new word w’ from w by replacing one of the ‘‘letters’’ of w (i.e., 
one of the generators of Il, which appears in w) by a different 
“‘letter.’? We ask what conditions on the parameters w, p;, € are 
necessary and sufficient to ensure that w’ will belong to E(p;, €). 
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Let us first notice that if w is an element of E'(p;, €), then any 
permutation of the letters of w again yields a word in E'(p;, €), the 
length of which is the same as that of the given word w. We shall 
say that two words which differ by a permutation (i.e., which differ 
only in sequence and not in composition) are isomeric. We now 
observe that the replacement of a single letter of a word-w by a 
different letter results in a word w’ of the same length as w, but 
which is not isomeric to w. We may therefore reformulate our 
question in the following more specific manner: how many non- 
isomeric words of length N occur in the quasi-ergodic submonoid 
E(p;, €)? We shall for the moment not restrict E(p;, &) to being 
submonoid of jl4, but shall work in the general free monoid ‘i,. 

We may gain some insight into these questions by investigating 
their solutions in the limiting case of strict ergodicity (Rosen 
1959a); i.e., © = 0. In this situation, it will be readily verified 
that the following result holds: given an integer N, there is at 
most one word w of length N which, together with all its isomers, 
lies in E33 moreover, in order for such a word to exist, it is neces- 
sary and sufficient that N = kp, where & is a constant, and p is the 
least common denominator of the (rational) numbers p,;. Therefore, 
since no pair of non-isomeric words of the same length can belong 
to a strictly ergodic monoid Bod, we can see at once that Freese’s 
Hypothesis is equivalent to asserting the non-existence of any 
strictly ergodic DNA-protein code. 

In the case of a quasi-ergodic code E(p;, €), the situation is dif- 
ferent. It is not difficult to see qualitatively that the following 
relations hold in a quasi-ergodic submonoid: (a) the greater the 
value of A(w), the greater is the chance that a word w”’ obtained 
from w by a single substitution will again be in the submonoid; 
(b) there is a minimal length Ny such that, whenever A(w) < No, a 
single substitution will always result in a word w’ which no longer 
lies in the submonoid. In genetic terms, we can see that one of 
the advantages of utilizing a quasi-ergodic code lies in the fact 
that there is a buffer (depending on the magnitude of €) between 
accidental alterations in sequence and complete loss of information- 
bearing capacity. The statements (a) and (b) merely assert that 
this tolerance increases with the length of the word w, and that 
there is a certain minimal length below which the tolerance be- 
comes zero. We now proceed to make these remarks more precise. 

Let us now derive a simple criterion for determining whether or 
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not, given a quasi-ergodic submonoid E(p;, €) C Mt, and an integer 
N, there exist words of length N in E(p,;, €). That is, does there 
exist a word w such that A(w) = N, and 


tA; Go) — ND P< eN te denen et (1) 


If we interpret the relations (1) geometrically, we see that, for 
each 7, the integer \,(w) is restricted to lie within the open in- 
terval, the midpoint of which is Np; and the length of which is 
2N; i.e., A;(w) must lie in the interval 


(Np,;-Ne, Np; +N). (2) 


Clearly, a word w satisfying (1) will exist if and only if the in- 
terval (2) contains an integer for each 7. This is the desired 
criterion. 

We can sharpen the above argument considerably. Let us denote 
the set of integers lying within the interval (2) by Z;; we then ob- 
tain the result that the number of non-isomeric words of length N 
which satisfy (1) is given by the number of different combinations 
of integers which satisfy the relation 


n 


>, mi=N. (3) 


cs 
(njpeZji) 


In fact, for each sum of the form (3) which can be constructed, we 
obtain a corresponding word w by putting n; = A,(w). 

We can now show that there exists an integer N such that for 
each integer N >N there exists a word w with the properties that 
A(w) = N and moreover w lies in E(p;, €). That is, a guasi-ergodic 
submonoid contains words of length N for almost all values of N. 
In fact, we can write down a condition for determining the number 
of non-isomeric words of a given length which lie in a given 
quasi-ergodic submonoid, as follows: let us first suppose that the 
given integer N has the property that Np; is not an integer for any 
t,¢=1,2,...,m”, Then, for each 7, there is an integer lying with- 
in the interval (Np;-~ 5, Np;+4). Comparing this with (2), we 
find that the interval (2) contains an integer only if one of the 
inequalities, 


Npi-Ne<Np,- > or Np;+Ne>Np, +H, 
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holds. Either of these inequalities yields the condition 
N >1/2€ or € > 1/2N (4) 


which, it will be observed, is independent of the index i. Con- 
versely, if (4) holds, there will be a word of length N in E(p,;, €). 
Hence, (4) is necessary and sufficient to ensure the existence of 
a word of length N in E(p,, €). 

Under the same hypotheses, if we wish there to be two non- 
isomeric words of length N in E(p;, &), we find that, by the same 
reasoning as used above, we must have one of the inequalities, 


Np;-Ne<Np;-1 or Np;+Ne>No;+1, 
satisfied, which yields the condition 
€ > 1/N. 
In general, in order for there to exist & non-isomeric words of 


length N in E(p;, €), it is necessary and sufficient for the 
inequalities 


N>k/2 €or €>k/2N (5) 


to hold. 

Similar arguments may be applied in case N,,; is an integer for 
some 7, with but minor modifications. The discussion is left to 
the reader. 

The inequality (4) and the more general inequality (5) make it 
possible to discuss Freese’s Hypothesis in a qualitative manner 
from the standpoint of the general DNA-protein coding problem. 
Since Freese’s Hypothesis requires the existence of two or more 
non-isomeric words of the same length in E(p;, €), we find this hy- 
pothesis placing restrictions upon the parameters which determine 
the code, and this in turn affects the genetic behavior of the code. 
For example, if there are to be a fixed number & of non-isomeric 
(i.e., mutant or allelic) words of a given length in the code, then 
(5) implies the existence of a lower bound No such that for words 
of length smaller than No no mutation by simple substitution can 
occur. We shall see that this type of prediction can in principle 
be checked experimentally. 

As an example of how Freese’s Hypothesis places restrictions 
on the structure of a hypothetical quasi-ergodic DNA-protein code, 
let us investigate in greater detail the implications of the above 
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discussion for a particular organism. Ideally, of course, such a 
discussion requires that the structure of each information-carrying 
DNA molecule and its corresponding polypeptide be known, but 
this type of information is almost completely lacking. What infor- 
mation does exist is spread out over a number of widely divergent 
organisms, and there is no guarantee that the same code operates 
throughout the biological world (cf. Doty e¢ al, 1959). 

The most satisfactory data available at present is that derived 
from the fine-structure genetic analysis of the r// mutants of the 
T4 bacteriophage (Benzer 1955, 1957; Benzer & Freese 1958). Al- 
though the phage is a highly atypical organism, there does not ap- 
pear to be any reason to suppose that the implementation of the 
genetic information of the phage is carried out by a mechanism 
which differs in principle from those utilized by higher organisms. 
In fact, since Freese’s Hypothesis itself was put forward on the 
basis of data obtained from experiments with bacteriophage, this 
assumption is implicitly required for the universal validity of the 
Hypothesis. 

It has been shown that the r//J region of the phage genome con- 
sists of two portions, or cistrons, which are taken to comprise 
functional units. On the basis of the definition of these units (cf. 
Benzer, Joc. ctt.), it is reasonable to suppose provisionally that 
each cistron actually corresponds to a word of the DNA-protein 
code employed by the phage. That is, the segments of the phage 
genome which correspond to the A and B cistrons of the r// region 
shall be assumed to be words in the quasi-ergodic submonoid 
E(p;, ©) corresponding to the code utilized by the phage. 

It is known (Benzer, 1958) that the length of the A and B cis- 
trons in recombination units are approximately 4 per cent and 2 
per cent recombination units respectively. The total length of the 
phage genome, which is supposed to comprise a single linkage 
group, is of the order of 800 per cent recombination units. It is 
further generally assumed that the phage genome is a single 
particle of DNA, whose length in nucleotide pairs is approximately 
8 x 10* (cf., however, Sinsheimer, 1960, p. 193 et seq.) Assuming 
a uniform relationship between length as expressed in recombina- 
tion units and as expressed in nucleotide pairs, we can infer from 
these data that the approximate lengths of the A and B cistrons in 
nucleotide pairs are approximately 400 and 200 nucleotide pairs 
respectively. 


es 
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Thus far (cf. Freese, 1959a) a total of 65 mutants of the 4 cis- 
tron and 42 mutants of the B cistron have been discovered. Ac- 
cording to Freese’s Hypothesis, each of these mutants has arisen 
by a single substitution in the words w 4, Wg, corresponding to the 
wild-type A and B cistrons respectively. If we take the lengths 
of these words in nucleotide pairs, then it follows from (5) that we 
must have 


Sea and «> —*"%__~ 105 


8 x 10? 4x 10? 


for the A and B cistrons respectively. For simplicity, let us take 
the value €~ 0.1 (there being no reason to pick a larger value of 
€ than absolutely necessary). 

The smallest word in the code which admits a substitution is, 
according to (3), characterized by the smallest Ng such that 
No>1/e. Substituting the value ¢=0.1, we find that No ~ 10. 
Therefore, since there must be at least three nucleotide pairs per 
amino acid in the code (cf. Crick et al, 1957) we arrive at the con- 
clusion that the smallest polypeptide which can be produced in 
mutant form in the code employed by the phage is actually a tri- 
peptide. However, this calculation is based on the assumption 
that the length of a DNA molecule should be measured in nucleo- 
tide pairs. If we suppose that each individual nucleotide should 
be counted (as in the original scheme of Gamow, 1954), then the 
total length of the phage genome becomes 1.6 x 10° and ¢ is of 
the order of 0.05. In this case the smallest polypeptide which can 
mutate in the phage code is a hexapeptide. 

It might be of interest to investigate the question of whether 
small polypeptides are actually produced by the metabolic activity 
of the phage, and, if so, to investigate their sequence. If the hy- 
pothesis of ergodicity (or quasi-ergodicity) is correct, then such 
an analysis might lead to a concrete understanding of at least a 
portion of the phage code, according to the principles we have 
described previously (Rosen 1959a, 1959b). 

Let us now attempt to compare the information which has been 
derived above from data on bacteriophage with that available from 
higher organisms. Perhaps the data most suitable for this purpose 
are those obtained in the intensive study of the chemistry and 
genetics of human hemoglobin (cf. Sutton, 1960, for an informative 
discussion and further references). It is known that normal hemo- 
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globin (Hemoglobin A) is composed of two identical half-mole- 
cules, each of which is composed of two dissimilar peptide chains, 
labeled % and 8. Thus far, approximately 20 mutant varieties of 
hemoglobin have been discovered (apart from foetal hemoglobin 
and its variants, which are not of interest in the present context). 
Some of these abnormal hemoglobins have been shown to possess 
altered amino-acid sequences on the &-chain (Hemoglobins I, 
Hopkins-2); certain others (Hemoglobins C, E, 5) possess altered 
B-chains. Still other mutants (Hemoglobin D) have not as yet been 
unambiguously characterized, and one variety (Hemoglobin H) has 
been found to consist of four B-chains. Our present concern is, of 
course, only with those mutants showing an alteration of amino- 
acid sequence in either the % or the 6-chain. For simplicity, we 
shall suppose that there are 10 mutants with alterations on the 
o&-chain and a similar number with alterations on the 6-chain. 

We may suppose (although this point has not yet been settled) 
that the «- and f-chains are specified by different words of DNA. 
It is known that both the 4- and the f-chain have a length of ap- 
proximately 140 amino acid residues and, if we make the assump- 
tion that the DNA-protein code in man involves three nucleotide 
pairs per amino acid, it follows that the corresponding DNA words 
have a length of at least 420 nucleotide pairs each. With these 
assumptions, it follows from (5) that we have € > 10/2(420) ~ 0.01, 
(It will be noticed that this value is approximately 10 times smaller 
than that obtained above for bacteriophage.) Moreover, we find 
from (3) that the smallest DNA word capable of retaining genetic 
information under a substitution, in the DNA-protein code of man, 
is of the order of 1/2(.01) = 50 nucleotide pairs. Once again as- 
suming three nucleotide pairs per amino acid, we find that the 
length of the smallest mutable peptide in the human code is ap- 
proximately 17 amino acid residues. 


Small peptides of this order of magnitude are known to play 
important roles in human metabolism. In fact, peptides of approxi- 
mately half the length of the ‘‘smallest’? mutable peptide, es- 
pecially the pituitary hormones oxytocin and vasopressin, play an 
essential role in physiological regulation. As far as the author is 
aware, no mutant forms of these hormones have thus far been 
discovered; we have in fact just shown that if Freese’s Hypothe- 
sis and the ergodicity of the code are correct, and if these peptide 
hormones are directly coded from DNA molecules, then no mutant 
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forms of these hormones can ewist. Of course in this case, we 
cannot discount the possibility that these small peptides are not 
coded de novo, but are enzymatically cleaved from a larger protein 
moiety (cf. Pickford, 1960). Nevertheless, it should be clear from 
the above, and from our discussion of bacteriophage, how a direct 
experimental test of Freese’s Hypothesis and the ergodicity of the 
code can in principle be constructed. 

To sum up, we have seen that Freese’s Hypothesis, together 
with the assumption that the DNA-protein coding process is quasi- 
ergodic, leads to a number of empirically testable results. Specifi- 
cally, we showed that the smallest polypeptides in a fixed species 
which can mutate (i.e., which can according to Freese’s Hypothe- 
Sis undergo a specific alteration in sequence while retaining some 
metabolic activity), together with data concerning the total number 
of possible alleles (i.e., non-isomeric DNA molecules of the same 
length), place two separate restrictions on the value of € which 
characterizes the code in question. If it should be found that 
these two restrictions yield inconsistent results (i.e., if a mutable 
polypeptide is found which is of smaller length than that calcu- 
lated from the number of alleles, or conversely if the number of 
alleles of a fixed length is found greater than is allowed from 
considerations of the smallest mutable polypeptide), we would 
have to conclude that Freese’s Hypothesis and the ergodicity of 
the code cannot both be correct. Of course, we could refine the 
above arguments to take account of multiple substitutions, etc., 
but the qualitative conclusions are not altered by these refine- 
ments. On the other hand, if both of these basic assumptions are 
correct, then a new tool is provided for the investigation of the 
general coding problem. 

We shall conclude this section by remarking that, since we have 
shown that physiological ranges of mutability place a lower bound 
on the values of € which can occur in quasi-ergodic DNA-protein 
codes, we can conclude (cf. Rosen, 1959b, p. 296 e¢ seq.) that the 
total number of such codes which can occur in nature is finite. 
In fact, we may even be able to estimate this lower bound and 
hence arrive at an estimate for the number of possible codes. If 
we assign one code per species, or one code per genus, we can in 
this manner assign an upper bound for the number of possible 
species or genera of organisms utilizing different codes. It may 
be interesting to compare such a result with the estimate of 
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Rashevsky (1954), which was obtained from quite different 


considerations. 


Ill. Some Further Considerations. We shall now discuss certain 
features of our formulation of the general DNA-protein coding 
problem which are suggested by Freese’s Hypothesis but which 
are only indirectly related to the preceding discussion. We wish 
in particular to derive an expression for the ratio of the total 
number of words of length N belonging to a given quasi-ergodic 
submonoid E(p;, €) C Xt, to the total number of words of length N 
in Sl,, and examine the asymptotic behavior of this ratio as N—>oo. 
We shall see that this type of discussion may have some bearing 
on the evolutionary steps leading to the implementation of such a 
code. 


Lemma 1: The number of isomers of a word wel, is 
A(w)! 


n 


i=l 


(6) 


Proof: Elementary. 


Lemma 2: The total number of words (isomeric or not) of length 
Nin, is n%, 

Proof: Let S denote the set of the first N integers. Then every 
word of length N in Jl, may be regarded as a mapping of S into the 
set {a@,,a@2,...,4@,} of generators of Nl,. By elementary set- 
theoretic considerations, it follows that there are n% such map- 
pings. QED. 


On the basis of these considerations, we can write n* in a 
particularly useful form, which we now proceed to derive. We first 
notice that every word w of length N in Ml, is specified up to 
isomerism by the numbers A,(w), Ag(w),..., An(w), where EA,(w) = 
A(w)=N. Thus, the integers {A,(w)} form a partition of N into 
precisely n parts. Moreover, each such partition of N determines 
the composition of some word of length N (and all its isomers) in 
ti, However, each partition of N into n parts carries with it a 
certain ‘‘multiplicity,’’ because each such partition may determine 
a number of different (i.e., non-isomeric) words. Thus, the parti- 
tion 20=3+7+1+9 may equally well determine the words w, 
w’ specified by 
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A ,(w) = 8; Ag(w)= 7; As(w)=1; Ag(w) = 9; 


Ay(wW) = 7; Ag(w)= 1; As(w) = 9; Ag(w’) = 3; 
with several other possibilities existing as well. However, it is 
easy to write down the multiplicity associated with a partition 
(i.e., the number of non-isomeric words which correspond to the 
partition in question). 


Lemma 3: Let N=r,+?.+...+7, be a partition of N into ex- 
actly n parts. Let v(r;) denote the number of times that 7; occurs 
in the partition. Then the multiplicity of this partition is given by 


n! 
IT be! 
i=l 


Proof: This can be established by an argument identical to that 
used in proving Lemma 1] and is likewise left to the reader. 


Let x(n, N) denote the number of partitions of N into exactly n 
parts. It is known that the generating function for enumerating 
these partitions is 


peer aa ak ee (7) 
(i-2)(1 ~27):..(1 — 2”) 
(cf. McMahon, 1960). That is, the number x(n, N) is the coef- 
ficient of the term 2% when (7) is expanded as a power series in a. 
Given a partition N=r,+fo+?%g3+.-.+7,, 1t follows from 
Lemma 1 and Lemma 3 that the total number of words, isomeric or 
not, which corresponds to this partition is given by 


n! MN! 


(8) 
TT (ers) TT i) 
i=1 t=1 

Since every word of length N in {l, arises in the above manner 

from some partition of N into n parts, the total number of these 

words is obtained by summing (8) over all such partitions. Invok- 
ing Lemma 2, this yields 


Dy rea eae (9) 
TT Wr!) I (r;!) 


i=1 
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Now let E(p,, €) be a quasi-ergodic submonoid of ‘l,, and let us 
denote the number of words (isomeric or not) of length N in E(p;, €) 
by p(N). We desire to examine briefly the behavior of the ratio 
o(N)/n% as N—+o. To do this, let us rewrite the expression (9) 
by writing each of its terms of the form (8) and writing it explicitly 
as a sum; i.e., in the form 


N! N! N! n! 
fe eet ; times, 


n n TI nr 

Gee 6 ar) TD) 0 iat) 

i=1 i=l i=1 i=1 

All these terms are then summed as in (9). This new sum has the 
advantage that each of its terms represents the number of isomers 
of a word w of length N in {l,, and moreover each such word corre- 
sponds uniquely to a specific term in this sum. It can then be 
seen that p(N) is given exactly by some partial sum of the terms 
entering into this expression. The number of terms which occur 
in the partial sum which gives p(N) depends of course on the 
parameter €, and in fact is equal to the number of partitions of N 
into n parts for which the 7th part 7; falls into the interval Z; 
specified by (2) above. 

At this point, we may at once simplify and clarify our discus- 
sion by pointing out that, by virtue of Theorem 4 of Rosen, 1959b, 
there is no loss of generality in restricting our attention to the 
special case n = 2. In this case, we can write (9) in the form 


gh eye Lele ete GSS + pO aes 
(V-1it mw-p!| | w-92)2! 


NI N! 
SIN — 9)! = a t+ 2c + (waive (10) 
or 
ead Get fg eek 2 gay 
Sue aoa a eee A 


N! N! 
iis -1)\N/24 1)! (N/2+ 1)N/2— | si 


according as N is even or odd respectively. There are N + 2 and 
N + 3 terms in (10) and (11) respectively. On the other hand, it is 
clear from (2) that approximately 2Ne terms of (10) or (11) will 
give the partial sum corresponding to p(N). Now it is easy to see 
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intuitively that if € is very small, then the magnitude of any 2Ne 
terms of (10) or (11) will be negligible in comparison with 2%, 
Thus, in this case, we have 


: N 

oe a = 0. (12) 

Stated another way, if € is sufficiently small, then the probability 

that a word picked at random from Jl, will belong to E'\(p;, &) is zero. 
On the other hand, if € is sufficiently large, then the sum of the 

terms comprising p(N) will be appreciable in comparison with 2%, 

In this case, we may write 


- P(N) 

Lim = ils (13) 
For such €, we may say that a word picked at random from Sil, has 
unit probability of belonging to E(p,, €). 

It is not at the moment clear whether there is a sharp threshold 
value for €, below which (12) holds and above which (13) holds, or 
whether the limit increases continuously and monotonically from 
0 to 1 with e. It is clear that a more refined analysis will be re- 
quired to settle this point, and we hope to return to this problem 
in another place. However, the possible biological implications of 
these two alternatives may be discussed. In the first case, we 
have the result that the random formation of a DNA molecule (say) 
will on the average either never or always carry information in a 
specific code E(p;, €), depending on the value of the parameter € 
associated with the code. In the second case, a randomly formed 
molecule may have an appreciable probability of carrying informa- 
tion in a specific code, even if the € associated with that code is 
small. Stated otherwise, the probability that a family of DNA 
molecules could be accumulated at random, satisfying the condi- 
tions (1) that each molecule of the family carries specific infor- 
mation in a single code specified by a submonoid E(p,, €), where 
€ is a small number, and (2) that the information carried by the 
family is sufficient to characterize a living organism, is in the 
first case vanishingly small, while in the second case it is finite. 
Such considerations might tend to lead us to favor the second 
possibility; however, it is perhaps of interest to recognize that a 
highly primitive organism such as bacteriophage T4 seems to be 
characterized by an abnormally high value of €. Moreover, by 
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virtue of the connection of the magnitude of ¢€ with the mutability 
of information-bearing words which was developed in the preceding 
section, and the fact that the earliest organisms must have been 
highly mutable, the second possibility also has certain biologi- 
cally attractive features. However, since this question can be 
settled by explicit mathematical calculations, it is perhaps best 
to withhold such speculations until the exact solution is 
forthcoming. 
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